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A STIMULUS-TRACE HYPOTHESIS FOR STATISTICAL 
LEARNING THEORY ! 


ROBERT S. WITTE? 


Stanford University 


A recent extension of the verbal 
conditioning paradigm to sequentially 
dependent reinforcements has exposed 
a limitation of statistical learning 
theory. This study attempts to 
remove that limitation by testing an 
extension of the theory. 

Statistical learning theory yields 
several deductions, two of which will 
be considered here, for the asymptotic 
behavior of Ss in verbal conditioning 
situations. According to the theory, 
Ss should predict a given reinforcing 
event about as frequently as the event 
occurs. A number of experiments 
(e.g., Estes & Straughan, 1954; Good- 
now & Postman, 1955) lend support 
to the “probability matching’’ deduc- 
tion. The theory also predicts a 
slight sequential dependency between 
Ss’ predictions on Trial m and the 
reinforcing event on Trial m — 1. 
Tests of this deduction (Anderson 
& Grant, 1957; Bottenberg, 1956; 


! This study is part of a thesis submitted 
to the Department of Psychology of Stanford 
University in partial fulfillment of the re- 
quirements for the Ph.D. degree. The 
writer wishes to thank Douglas H. Lawrence, 
who acted as sponsor, and William K. Estes, 
Richard C. Atkinson, and Quinn’ McNemar 
for their valuable suggestions and criticisms. 

2 Now at Cornell College, Mount Vernon, 
lowa. 


Estes & Straughan, 1954) have found 
larger sequential dependencies than 
forecast by the theory. 

Following the lead of Hake and 
Hyman (1953), Lawrence employed 
a verbal conditioning design in which 
successive reinforcements were se- 
quentially dependent.’ As required 
by the theory, his Ss predicted a 
given reinforcing event about as 
frequently as the event occurred, 
but they also structured their pre- 
dictions in a manner reflecting the 
sequential dependencies between rein- 
forcements. For example, in one of 
Lawrence's groups each of two rein- 
forcing events, £; and FE», occurred 
50% of the time. In addition, FE, 
followed E, 90% of the time, while 
E, followed EF, 10% of the time. The 
Ss working under this schedule not 
only predicted EF, 50% of the time, 
but they also predicted /; approx- 
imately 90% of the time on trials 
following E£,; occurrences. Except for 
the slight sequential dependency men- 
tioned above, the probability of EF, 
predictions on trials following /, oc- 
currences should approach .50. This 
was clearly not the case in the study 


* Unpublished study by D. H. Lawrence. 
Personal communication, July, 1956. 


273 





274 


by Lawrence or in more recently 
reported studies (Anderson, 1956; 
Engler, 1958). 

Inasmuch as the above experimen- 
ters used relatively massed trials, the 
stimulus confronting S on any trial 
may have been a complex one con- 
taining traces of previous responses 
and reinforcements. As suggested 
by Estes and Straughan (1954), 
assume that the stimulus includes a 
trace of the reinforcing event from 
the previous trial. Utilizing this 
trace assumption, Burke and Estes 
(1957) and the writer (1958) derived 
expressions for conditional and mar- 
ginal response probabilities which 
closely approximate the results ob- 
tained when sequentially dependent 
reinforcements were used. 

The extended model assumes that 
no overlap occurs between the stimu- 
lus elements which are present on 
trials following -,; and EF, events. 
If trial spacing is distributed, this 


assumption becomes untenable and 


the model must be modified. A rein- 
terpretation of Estes’ (1955) theoreti- 
cal formulation for spontaneous re- 
gression permits the prediction of the 
effect of trial spacing on conditional 
response probability. 


Three mutually exclusive sets of stim- 
ulus elements, S*,, S*;, and S*s, can be 
defined so that each set contains two 
mutually exclusive and exhaustive sub- 
sets, S, and S’,; S; and S’;; and S, and 
S's, respectively. Let S, represent the 
set of all available elements on a given 
trial, except for those available elements 
associated with the reinforcing event of 
the previous trial. Typically, 5S, is 
identified with postural and situational 
cues which are present at the start of any 
trial. Let S; represent the set of avail- 
able elements which supplements S, on 
trials following an E, occurrence, and let 
S, represent the set of available elements 
which supplements 5S, on trials following 
an E, occurrence. In the present model, 
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S, and S; are identified with those avail- 
able elements contributed by the hy- 
pothesized trace of the previous rein- 
forcement. 

For the purpose of this paper, one can 
assume that S, contains a constant num- 
ber of elements N, at the start of each 
trial, regardless of the size of the inter- 
trial interval. Now, however, assume 
that the N, elements in S, and the N, 
elements in 5S, fluctuate randomly be- 
tween S, and S’; and between S; and S’», 
respectively, during the intertrial in- 
terval. During a unit of time At follow- 
ing an E, occurrence, there exists a 
probability j,; that any element in S;, the 
available set, will escape into S’;, the 
unavailable set. During the same in- 
terval there exists a probability 7’; that 
any element in S’; will gointo S;. Then 
the general solution for F(t), the prob- 
ability that any element is in S, after the 
(t)th interval, is 


xX d—j—7s)' (1) 
where F(0) represents the value of F(t) 
when £, occurred. 

Since S; is identified with a stimulus 
trace, it is reasonable to assume that 7’; 
equals zero. Thus, Equation 1 reduces 
to 


Fi(t) = F,(0)(1 — j:)* [2] 
Defining N,(t) as the number of elements 
in S,; after the (¢)th interval and sub- 
stituting N,(t) for F,(t) in Equation 2, 
one obtains 

N,(t) = Ni(1 —_ ji) [3] 
Then p:;(n,t), the probability of an EZ, 
response after Trial m, given an E, occur- 
rence of Trial m and an intertrial interval 
of size t, can be expressed as 
ae ea 
Net Ni(t) 

X {NL (1—6) Fo(n— 1) +6] 

+N, ()((1—Ol) Fi(n—1)+611)} (4) 


pPii(n,t) = 
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where @ is a sampling parameter; II; is 
the probability of an E, event following 
an E,; event; F,(m— 1) is the prob- 
ability that the ith element in S, is 
conditioned to the EZ; response class after 
Trial nm — 1; and F,(m — 1) is the prob- 
ability that the ith element in S; is 
conditioned to the E; response class after 
Trial » — 1, given an E, occurrence on 
Trial n — 1. 

When rewritten in terms of asymptotic 
behavior, Equation 4 takes the form 


1 
Ne + Nilt) 
x {N,.[Il + @(1 — ID] 
4+ Ni()(Mhn + 61,1 — 1111) J} (S] 


where II is the probability of an E; event, 
and the terms in the square brackets now 
represent the asymptotic values of 
F.(n — 1) and F,(m— 1), plus small 
” “correction”’ terms required by the con- 
sistent incremental effect of the previous 
FE, occurrence. According to Equation 
5, when N,(t) is comparatively large, 
conditional response probability ap- 
proaches II,, plus a term smaller than @ in 
magnitude, and when WN,(t) is com- 
paratively small, it approaches II plus a 
small term less than @ in magnitude. 

An expression analogous to that for 
Pi(@ ,t) can be obtained for Pale i. 
the asymptotic probability of an EF, 
response after Trial m, given an EF, 
occurrence on Trial m and an intertrial 
interval of size t: 


1 
Ne + N2(t) 
xX {Nfl — 611] 

+ N2(t)(IT 21 — A112, (1 oan TT 21) } [6] 
where II» is the probability of an E, 
event following an EE, event. Except 
for a change in notation and in the cor- 


rection terms, Equation 6 has the same 
properties as Equation 5. 


pPii(®,t) = 


pale ,t) = 


The remaining part of this paper 
describes an attempt to test the trace- 


fluctu’ntion 
intertrial 


model by 
interval and 


varying the 
noting its 
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effect on conditional response prob- 
ability. 


METHOD 


Design.—Practical considerations dictated 
the use of rats as Ss; E anticipated much 
difficulty in motivating college sophomores 
to participate in that phase of the experiment 
requiring highly distributed trials. Fortu- 
nately, the identifications between theory and 
data for the verbal conditioning situation 
can be transposed readily to the T-maze 
situation with rats. More specifically, a 
food reward in the left or right goal box was 
identified with an E, or Ez event, while a left 
or right turn at the choice point was identified 
with an E£, or E, response. 

By definition, the intertrial interval began 
when S was lifted from the goal box, and it 
ended when S reached the choice point on the 
following trial. Since a pilot study indicated 
that the time required by most Ss to travel 
from the starting box to the choice point 
stabilized at about 2 sec., the size of the 
intertrial interval could be predetermined 
within a small margin of error. Each S was 
assigned permanently to one of four intertrial 
intervals: 8 sec., 30 sec., 3 min., or 1 hr. 

All Ss were rewarded equally often on 
both sides of the maze. One-half of the 
experimental Ss worked under a “repetition” 
reward schedule in which E, followed E, 90% 
of the time, while E, followed E, 10% of the 
time. To compensate for possible spon- 
taneous alternation tendencies (Dember 
& Fowler, 1958), the other one-half of the 
experimental Ss worked under an “alter- 
nation” reward schedule in which E, followed 
E, 10% of the time, while E, followed E, 90% 
of the time. As an additional check on 
alternation tendencies, a control group 
worked at the 8-sec. intertrial interval under 
a reward schedule in which EF, followed EZ, 
and E, equally often. Henceforth, the 
various groups of experimental Ss will be 
designated as 8-R, 8-A, etc., where the num- 
ber refers to the size of the intertrial interval 
in seconds and the letter refers to the reward 
schedule. é 

Apparatus.—The apparatus, a simple T 
maze, consisted of a gray stem, 28 in. long, 
and two black alleyways, each 36 in. long. 
All sections of the maze were 4 in. wide and 
4 in. high. The stem contained two guillo- 
tine gates, one at the starting box and the 
other at the choice point. In each alleyway, 
at a distance of 21 in. from the choice point, 
a hinged door separated the alleyway from 
the goal box. These doors could be locked by 
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concealed levers. The levers were operated 
mechanically from E's stand at the base of 
the maze. 

Approximately 3 ft. above the maze, a set 
of eight 100-w. bulbs were located as follows: 
one bulb above the starting gate, another 
above the choice point, and the remaining 
were arranged symmetrically above the two 
branch alleyways. No other light source was 
present in the experimental room. 

The intertrial interval for the 8- and 30- 
sec. groups was timed to the nearest second 
with an automatic recycling timer. To 
facilitate the accuracy of this measurement, 
button switches controlling the timer were 
located adjacent to the goal boxes. The 
interval for the 3-min. group was timed with 
a stop watch to the nearest 5 sec., and that 
for the 1-hr. group was timed with a wall 
clock to the nearest minute. 

Procedure.—During the first six days of a 
12-day training period, Ss were gentled and 
introduced to a 224-hr. food privation sched- 
ule. During the last six days of the training 
period, Ss explored each of the two alleyways 
of the T maze on alternate days, learning to 
run through the goal box door and locate a 
wet mash reward in the foodwell. 

After the training period, trials in the T 
maze were begun. An additional 12 days 
elapsed before Ss’ running times permitted the 
desired five trials per day. On each trial, 
after S had been placed in the starting box, 
the gates at the starting box exit and at the 
choice point were raised. When S's entire 
body penetrated a given alleyway, these 
gates were lowered, and one of the goal box 
doors was unlocked according to a predeter- 
mined schedule. If S ran to the side con- 
taining the locked door, he was forced to 
return to the choice point and proceed to the 
other side. After S had eaten the mash 
reward, weighing approximately .3 gm., he 
was lifted from the goal box, and the goal 
box door was relocked. The Ss in the 8-sec. 
group were returned immediately to the 
starting box and run for the next trial. The 
Ss in the remaining groups were placed in 
individual detention cages for prescribed 
periods of time before receiving the next 
trial. 

To further motivate a few Ss who occasion- 
ally froze in the maze, all Ss who refused to 
run on any trial lost their 14-hr. free-feeding 
privileges for that day. This technique was 
required only during the first three days of 
the experiment. Ordinarily, food depriva- 
tion at running time ranged between 18 
and 22 hr. All groups participated in a 
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counterbalanced design yielding an average 
of 20-hr. food deprivation at running time. 

The Ss in the control group received a 
total of 200 trials. In an attempt to give 
all experimental Ss an ample series of trials 
in which to discriminate the sequential 
dependencies between reinforcements, the 
experimental Ss at the 8- and 30-sec. intervals 
received 100 trials more than did the control 
Ss, while Ss at the 3-min. and 1-hr. intervals 
received 200 trials more than did the control 
Ss. 

The reward schedule was predetermined 
separately for each S over blocks of 100 
trials. These schedules were constructed 
from tables of random numbers with the 
restriction that the reward sequence reflected 
within a margin of 2% the conditional prob- 
abilities specified in the experimental design. 

Subjects—The Ss were 171 albino rats 
between 90 and 140 days of age at the start 
of the experiment. Each of the groups in 
the study received approximately propor- 
tionate allotments of the 120 males and 51 
females. Twelve fatalities occurred during 
the experiment. This high mortality rate 
was primarily caused by respiratory complica- 
tions, first detected during the training period 
and subsequently treated with sulfathiazole. 
Table 3 shows the number of Ss in each 
group at the end of the experiment. 


RESULTS 


Unless noted otherwise, all analyses 
were based on responses from the 
last four trials of each day, and all 
estimates of asymptotic performance 


were derived from the final two 
blocks of 40 trials each. To deter- 
mine whether an asymptote had been 


. reached, the mean difference between 


these trial blocks was tested for each 
group, but the test is mentioned ex- 
plicitly only when significant.‘ Since 
Ili; = Tee and Ilex = Iie in this 
study, more reliable estimates of 
conditional response probability were 
obtained by calculating pii(m,t), an 
average of the estimates ~1:(m,t) and 
Poo(n,t), and by calculating ;;(n,1), 
an average of the estimates p2:(m,/) 
and pi2(n,t). 


‘In this paper the .05 level is adopted as 
the minimum level for significance. 
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TABLE 1 


TERMINAL ESTIMATES OF CONDITIONAL RESPONSE PROBABILITY 











Intertrial Interval 











All Ss 

Nonfixated Ss 

Nonfixated Ss, 2nd 
trial only 








* Probability measure is pii( @, 4). 
> Probability measure is py(@, ¢). 
¢ The difference between the final two trial blocks was significant (P < .001). 


During the first 40 trials, condi- 
tional response probability clustered 
about .50 for all experimental groups, 
except Groups 8-A and 30-A. As 
will be seen, the corresponding esti- 
mates of .74 and .63 for Groups 8-A 
and 30-A were probably inflated by 
alternation tendencies. All experi- 
mental groups, except 8-R, appeared 
to reach an asymptote between 80 
and 160 trials. No relationship was 
detected between rate of asymptotic 
approach and the experimental condi- 
tions. The above trends also char- 
acterize the performance of the sub- 
class of ‘‘nonfixated’’ Ss, to be 
defined later. 

Terminal estimates of conditional 
response probability for all experi- 
mental Ss are given in Table 1. 
Even though the estimate for Group 
8-R jumped from .69 to .77 during 
the final two trial blocks, it is retained 
as the best available estimate of the 
asymptote for that group. 

Since, for all groups beyond the 8- 
sec. interval, the distributions asso- 
ciated with the terminal estimates 
were highly skewed with heterogeneous 
variances, the main effects were 
tested by applying the extended 
median test for k independent samples 
(Siegel, 1956). The significant inter- 
trial interval effect (P < .001) was 





interpreted as qualitative evidence 
in favor of the theory. The signifi- 
cant reward schedule effect (P < .01) 
indicated that conditional response 
probability was higher under the 
repetition schedule than under the 
alternation schedule. However, the 
significance of these effects, particu- 
larly that of the latter, must be 
viewed cautiously since the inter- 
action probably was great. 

According to statistical learning 
theory (Estes, 1954), when II = .50, 
as it does for all Ss in this study, 
individual estimates of p,(«), the 
asymptotic probability of an F, 
response, should approximate a binom- 
ial distribution about .50. Instead, 
the obtained dispersion described a 
tri-modal curve with slightly more 
than 50% of the estimates deviating 
beyond the 99% acceptance interval 
for .50 (see Table 2). Approximately 


TABLE 2 


TERMINAL ESTIMATES OF ~—;(~) 
VALUES FOR ALL Ss 


pil) | No. of Ss I} No. of Ss 
.00 18 || .56-.66 20 

OAL | “| ‘67-77 | 14 

12-.22 | || .78-.88 | 4 

.23-.33 89-99 | 6 

34-44 

A5—.55 


pil @) 











TABLE 3 


FREQUENCY OF FIXaTED Ss 
(Total number of Ss in parentheses) 





Intertrial Interval 








Reward eee 
Schedule 
8 Sec. 30 Sec. | 3 Min. | 60 Min. 
Repetition 1 (133 2 (37) 8 (15) | 12 (15) 
Alternation 2 (18 17 (20) | 19 (20) | 18 (18) 














Note.—Of the 19 Ss in the control group, 3 were 
fixated. 





58% of the 159 Ss had pi(«) values 
larger than .50. According to the 
binomial test, this split was significant 
(P < .05), indicating an over-all group 
preference for the left side of the maze. 
Several trends among the Ss that 
deviated beyond the 99% acceptance 
interval for .50, henceforth called 
‘fixated”’ Ss, warrant mention. Most 
position habits appeared early in 
the study: 50% of the fixated Ss 
developed position habits during the 
first 40 trials; 75% during the first 
80 trials; and 95% during the first 
160 trials. According to the chi- 
square test, position preferences oc- 
curred most frequently among Ss under 
the alternation schedule (P < .001) 
and under highly distributed trials 
(P < .001) (see Table 3). 
' The fixated Ss complicate any 
quantitative test of the trace-fluctua- 
tion model since conditional response 
probability is suppressed toward .50 
whenever strong position preferences 
occur. Consequently, if the model 
were to be tested fairly, its application 
was necessarily limited to Ss having 
no pronounced position habits. To 
insure this, all fixated Ss were ex- 
cluded from the theoretical analysis.® 
Although this exclusion eliminated 
more than one-half of the Ss, it 
yielded adjusted mean estimates of 
pi(*) which differed only slightly 
from .50. Furthermore, it led to 
5 To facilitate comparison, the three 


fixated Ss in the control group were also 
excluded from subsequent analyses. 
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sets of individual estimates which 
approximated normal distributions, 
permitting the use of parametric 
statistics. 

For the nonfixated Ss in the control 
group, estimates of ;;(n,t) were 
employed as measures of reward 
alternation tendencies, and estimates 
of pA;;(n,t), the probability of an EF; 
response after Trial m, given an EF; 
response on Trial m, were employed 
as measures of response alternation 
tendencies. When successive rewards 
are independent, the theory predicts 
that ~;;(n,t) should approach II minus 
a small correction term and that 
pij(n,t) should approach II minus 
an indeterminate, but presumably 
small, correction term. The terminal 
estimate for p;;(,t) was .72, and that 
for pAi(n,t) was .63. Both esti- 
mates differed significantly (P < .001) 
from II, suggesting the presence of 
strong reward and response alter- 
nation tendencies. 

The frequency with which the 
control Ss alternated relative to both 
the previous reward and _ response 
remained fairly constant throughout 
the experiment. To determine the 
relationship between terminal esti- 
mates of reward and response alter- 
nation tendencies, the correlation 
ratio, eta, was calculated. The ob- 
tained value of .14 differed insignifi- 
cantly from zero; no consistent re- 
lationship existed between the terminal 
estimates of reward and _ response 
alternation for individual Ss. 

The terminal estimates of condi- 
tional response probability for the 
nonfixated experimental Ss are given 
in Table 1. The variances associ- 
ated with these estimates were .009, 
.008, .019, .003, .004, and .005 for 
Groups 8-R, 30-R, 180-R, 3600-R, 
8-A, and 30-A, respectively. Bart- 
lett’s test gave a nonsignificant result 
when applied to this data. The 
variance model used in the two- 











way analysis is the one which can 
be applied when data are missing 
completely from a given subclass 
(Snedecor, 1946). 

The main effects need not be quali- 
fied since the interaction was not sig- 
nificant. Once again, the significant 
intertrial interval effect (P < .001) 
was in the direction predicted by the 
theory. The largest decrease in con- 
ditional response probability occurred 
between the 8- and 30-sec. intervals 
(P < .02). None of the other de- 
creases between adjacent intervals 
was significant. 

The significant reward schedule 
effect (P <.01) was now in the 
direction anticipated by the experi- 
mental design, i.e., conditional re- 
sponse probability was higher under 
the alternation schedule than under 
the repetition schedule. At the 8-sec. 
interval, conditional response prob- 
ability under the alternation schedule 
was significantly higher (P < .001) 
than that under the repetition sched- 
ule. None of the differences at 
either the 30-sec. or the 3-min. 
intervals was significant. 

The significant difference at the 
8-sec. interval probably can be attrib- 
uted to the inflationary and depressive 
effects of alternation tendencies on 
pij(~,t) and pii(*,t), respectively. 
It was assumed that any alternation 
tendencies would have the same 
magnitude of effect on both estimates. 
Since these effects would be in oppo- 
site directions, pjx(n,t), an unbiased 
estimate of the probability of an 
E; response following an £; occur- 
rence, could be evaluated by averag- 
ing estimates of pii(m,t) and p;;(n,t) 
obtained under the repetition and 
alternation schedules. 

A more rigorous test of the model 
required that Equations 5 and 6 
be averaged to obtain an expression 
for pix(~,t). If, when the sequen- 
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tial dependencies between rewards 
are symmetrical, it is assumed that 
jt = je = jj and MN, = N2 = Ni, the 
resulting expression for p\,(,t) has 
the same form as Equations 5 and 6, 
except for the disappearance of the 
correction terms. 

Given the above conditions, one 
could estimate the theoretical param- 
eters j; and N, from the data at two 
intervals and generate exact predic- 
tions for pix(*,/) at the remaining 
two intervals. After j; = .01 and 
N./N; = .22 had been estimated from 
the 8-sec. and 3-min. intervals, pre- 
dictions were generated for the 30-sec. 
and 1-hr. intervals. As can be seen 
in Table 4, the discrepancies between 
obtained and predicted values were 
rather large. 

To test these discrepancies, weighted 
means were substituted for the un- 
weighted estimates of pix(~,t). Since 
the revised estimates differed only 
slightly from those appearing in Table 
4, the previously estimated values of 
ji and N,./N; were retained. Al- 
though still large, the differences 
between obtained and predicted val- 
ues did not lead to significant ¢ ratios. 

The slow decay rate of the trace 
stimuli, as suggested by the estimate 
of .01 for ji, implies that traces from 
several previous rewards may have 
been present on the 3rd, 4th, and 
5th trials for each day. To partial 


TABLE 4 


OBTAINED AND PREDICTED CONDITIONAL 
RESPONSE PROBABILITIES FOR THE 
NONFIXATED SS 








| 
| Intertrial Interval 
Data pix(@,t) | 
| 30 3 60 
Sec. | Sec. | Min. | Min. 
Responses on Obt. | 82° | .70 | .64* | .57» 
Trials 2-5 Pred. | 82 BO | 64 | SO 
Responses on Obt. | .81* | .68 | .63* | 49» 
Trial 2 Pred. | 81 79 | 63 50 
| | | 





* Data used to estimate theoretical parameters. 
>» Weighted mean. 
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out these effects, the responses of the 
nonfixated Ss on the second trial 
were analyzed for the last 20 days of 
the experiment. (This period cor- 
responded to the final 80 trials.) 
None of the previously nonfixated Ss 
fell in the fixated category as a result 
of this reanalysis. The revised ter- 
minal estimates for the nonfixated Ss 
are listed in Table 1. 

After jj = .01 and N,./N; = .27 
had been estimated from the revised 
data at the 8-sec. and 3-min. intervals, 





1 


Ne + | Nit + be) 
k=0 
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predictions of conditional response 
probability were made again for the 
30-sec. and 1-hr. intervals. Although 
the discrepancy between obtained 
and predicted values at the 30-sec. 
interval remains large, that at the 
1-hr. interval is now smaller, as can 
be seen in Table 4. 


The model can be extended to account 
for the presence of more than one trace. 
For subsequences of similar events under 
the repetition schedule, the necessary 
modification of Equation 5 reads 


—— {1+ Fea - ma — 1, 
k=0 


z—1 Fd 
+ (Mi + Yo OM (1 — My) (1 — OM) *] Ni(t+ ke)} [7] 
k=O k=0 


where c equals the time required for a 
complete trial cycle and x equals the 
number of available traces on any given 
trial. According to Equation 7, as the 
number of available traces increases, con- 
ditional response probability approaches 





1 


pii(% t,x) = —— 





II; plus a growing, but still small, cor- 
rection term. 

For subsequences of alternating 
events under the alternation schedule, 
the necessary revision of Equation 6 
reads 


N. + - Nit + 2kc) + > Ni{t 4. (1 + 2m)c ] 
k=0 m=0 


x (ot — FE ona — 0)§ + © oa — ma —6)"qw, 
k=O m=( 


y-l = 
+ (My — ¥ omy’ — My) — 6 + om) *) = Ni(t + 2kc) 
k=O k=O 


z—1 
+(Mli+ DO OM — 
m=O 


where, when x = 1, Equation 6 is ap- 
propriate ; when x is odd and greater than 
one, y = 4(x+1) and z = 4(x — 1); 
and when x is even, y = 2 = $x. Ac- 
cording to Equations 6 and 8, conditional 
response probability is a cyclical function 
of the number of available traces.® 


* Straightforward modifications of Equa- 
tions 7 and 8 yield predictions for conditional 
response probability following any sub- 


M)(1 — 6u)"] | Niet (1+ 2m)c]} [8] 


m=0 


To test the predictions of Equations 
7 and 8, the small correction terms 
were ignored, and ¢ was set equal to 
the intertrial interval plus 8, where 
the latter quantity is the average 





sequence of events. In this study, however, 


no attempt was made to check the theory 
against the remaining subsequences since 
they occurred infrequently. 
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TABLE 5 


OBTAINED AND PREDICTED CONDITIONAL 
RESPONSE PROBABILITIES FOR THE NON- 
FIXATED Ss AS A FUNCTION OF THE NUMBER 
oF PREvious REWARDS ON A GIVEN Day 
DuRING THE Last 20 Days OF THE EXPERI- 
MENT 



































Prob- 
ability Previous Rewards 
Measure 
Group 
pii(@, t) Ei | Eiki | EiBik; | EiEiBiki 
&R | Obt. | 08 | 81 82 84 
Pred. | 68 | .75 78 .80 
30-R Obt. | 69 | 82 82 83 
Pred. | .68 74 .78 80 
pi(@,t) | Ri Ei\kj E\ BE; | Ei Bj Eiki 
8-A Obt. 94 91 1 86 
Pred. | 90 | (50 | (68 54 











* Data used to estimate theoretical parameters. 


time in seconds required by Ss to 
run from the choice point to the goal 
box, and to eat the mash reward. 
Predictions were limited to the com- 
paratively large number of nonfixated 
Ss in Groups 8-R, 30-R, and 8-A. 
The estimates j; = .001 and N,/N; 
= 1.21 for Equation 7 were obtained 
from the revised terminal estimates 
for Groups 8-R and 180-R, shown at 
the bottom of Table 1. Predicted 
and obtained values for Groups 8-R 
and 30-R are given in Table 5 as a 
function of the number of previous 
rewards on a given day during the 
last 20 days of the experiment. (It 
should be noted that the number of 
observations on which the estimates 
in Table 5 were based decreased as the 
subsequence became longer. Thus, 
for Group 8-R, the estimate following 
E; was based on 320 observations, 
while the remaining estimates follow- 
ing E\E;, Ei\E\E;, and E\E\E\E; were 
based on 286, 256, and 227 observa- 
tions. The number of observations 
decreased at approximately the same 
rate for Groups 30-R and 8-A. 
Except for slight variations, all Ss 
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within a group contributed an equal 
number of observations to each of the 
estimates for that group.) Although 
the predicted values consistently un- 
derestimated the obtained values for 
Groups 8-R and 30-R in Table 5, 
the predicted trend occurred, and 
most of the discrepancies were fairly 
small, suggesting the presence of 
traces from at least two, and perhaps 
as many as four, preceding rewards.’ 

Estimates of j; and N,./N; could 
not be made for Equation 8 since the 
revised terminal estimate of .94 for 
Group 8-A was larger than the pre- 
dicted asymptote. Nevertheless, to 
provide a rough indication of the 
predictions generated by Equation 
8, it was assumed that j;=.001 and 
N./N;=.01. As-can be seen in Table 
5, not only are the discrepancies be- 
tween obtained and “‘predicted”’ values 
very large, but the predicted trend is 
not apparent. 

Finally, the theory can be checked 
against an additional data point at 
the 18- to 22-hr. interval. When the 
terminal estimates of conditional re- 
sponse probability were derived from 
responses on the first trials of the last 
20 days of the experiment, only 25 
of the 140 experimental Ss fell in the 
nonfixated category. The terminal 
estimate of .53 for these Ss did not 
differ significantly from the predicted 
value of approximately .50. 


DISCUSSION 


In spite of precautions taken to elimin- 
ate preferential cues or treatment in the 
T maze, the Ss collectively demonstrated 
a significant preference for the left side of 
the maze. Nevertheless, the high inci- 
dence of fixated Ss probably cannot be 


7 Since all of these discrepancies were in 
the same direction as the growing correction 
term, the fit between theory and data for 
Groups 8-R and 30-R is slightly closer than 
indicated in Table 5. 





282 


attributed solely to peculiarities of the 
experimental situation, since 30 Ss had 
position habits toward the right side of 
the maze. These results, when coupled 
with similar findings reported by Estes 
(cited by Bitterman, Wodinsky, & Cand- 
land, 1958) for rats and by Anderson and 
Grant (1957) for humans, suggest that 
the prediction of statistical learning 
theory for individual variability may 
need revamping. 

Why did the incidence of fixated Ss 
rise with increases in trial spacing? Per- 
haps strong response and reward alterna- 
tion tendencies interfered with the 
development of position habits. On the 
basis of previous studies (Dember & 
Fowler, 1958), it is reasonable to assume 
that the frequency of alternation de- 
creased as trial spacing became more 
distributed. Consequently, the develop- 
ment of position habits was no longer 
obstructed at the larger intervals. 

Why did the fixated Ss under the 
alternation schedule outnumber those 
under the repetition schedule? The Ss 
with position habits under the alterna- 
tion schedule received continuous partial 


reinforcement, while their counterparts 
under the repetition schedule received 
alternate periods of regular reinforcement 


and nonreinforcement. Presumably 
then, the position habits of the Ss under 
the repetition schedule should have been 
more susceptible to extinction. Accord- 
ing to this explanation, the responses of 
these Ss should lag behind shifts in rein- 
forcement from one side of the maze to 
the other, while the previously reinforced 
position habit was being extinguished. 
This lag tended to occur among the non- 
fixated Ss under the repetition schedule. 

In accordance with the theory, condi- 
tional response probability approached 
.50 as trial spacing increased. On the 
other hand, exact predictions of condi- 
tional response probability, particularly 
at the 30-sec. interval, did not accurately 
reflect the obtained values. Conceiv- 
ably, much of the discrepancy between 
theory and data could have been caused 
by the selective elimination of fixated Ss 
as the intertrial interval increased. To 
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avoid selectivity, an alternative design 
could be employed. Instead of being 
assigned permanently to a given interval, 
Ss could be rotated through all intervals 
after reaching the ‘“‘asymptotic’”’ series; 
even if it were necessary to make a 
separate analysis based on the nonfixated 
Ss, the same Ss would now be represented 
at all intervals. 

The agreement between obtained and 
predicted conditional response prob- 
abilities, as a function of the number of 
previous rewards, was fairly good for 
Groups 8-R and 30-R, but poor for 
Group 8-A. Although no attempt was 
made to determine precisely whether 
only two, or as many as four, preceding 
rewards affected the responses of the Ss 
under the repetition schedule, the present 
results resemble Anderson’s (1956) find- 
ings that the responses of human Ss were 
affected by three preceding rewards, and 
for at least one group, by four preceding 
rewards. For most groups in the study 
by Anderson, the mean probability of a 
repetition response, as estimated from 
the last 100 trials of a 300-trial acquisi- 
tion series, was an increasing function of 
the length of subsequences of similar 
events. As noted by Anderson, how- 
ever, the form of the function could have 
been caused by a growing correction term 
rather than by an accumulation of traces, 
since the slope of the function was slight 
in most cases. In the same study, ter- 
minal behavior following subsequences of 
alternating events reflected neither the 
cyclical trend specified by Equation 8 
nor the performance of Ss in the present 
study. More specifically, for all groups 
except one, the mean probability of an 
alternation response increased as a func- 
tion of subsequence length. Although at 
odds, the findings of both studies suggest 
that Equation 8 may be inadequate, but 
before discarding it, at least the finding 
for Group 8-A should be confirmed at a 
few larger intervals where the confound- 
ing effect of strong alternation tendencies 
should be negligible. Such a confirma- 
tion was not attempted in this study due 
to the high incidence of fixated Ss in 
Groups 30-A, 180-A, and 3600-A. 
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SUMMARY 


According to the trace-fluctuation model, 
terminal estimates of conditional response 
probability should match the sequential 
dependencies between reinforcements only 
when trial spacing is massed. When trial 
spacing is distributed, these estimates should 
approach a value of approximately .50. 

The Ss were 159 rats, each of which re- 
ceived from 200 to 400 trials in a simple T 
maze. Each S was assigned permanently 
to one of four intertrial intervals: 8 sec., 
30 sec., 3 min., or hr. To adjust for possible 
alternation tendencies, one-half of the experi- 
mental Ss worked under a “repetition” 
reward schedule, while the other half worked 
under an “alternation’’ reward schedule. 
Also a control group worked under a sequen- 
tially independent reward schedule. 

As trial spacing increased, terminal esti- 
mates of conditional response probability 
for all experimental Ss approached .50. 

More than 50% of all Ss developed position 
preferences. These preferences occurred most 
frequently among Ss working under the 
alternation schedule at the larger intertrial 
intervals. All Ss with strong position habits 


were excluded from subsequent analyses. 
Alternation occurred with respect to both 


the previous response and the previous 
reward. No consistent relationship existed 
between these tendencies for individual Ss 
in the control group. 

As trial spacing increased, terminal esti- 
mates of conditional response probability 
for all Ss without¥position habits also ap- 
proached .50. Exact predictions of condi- 
tional response probability did not accurately 
reflect the obtained values, particularly at 
the 30-sec. interval. 

A reanalysis of the data suggested that Ss 
under the repetition schedule were responding 
in terms of traces from at least two, and 
perhaps as many as four, preceding rewards. 
The agreement between theory and reanalysed 
data was fairly good for selected groups under 
the repetition schedule, but poor for Ss 
under the alternation schedule at the 8-sec. 
interval. 
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A number of experiments have 
been conducted to determine the 
effects that variations in magnitude 
of reward or in percentages of re- 
sponses rewarded have on rate of 
learning and final level of performance. 
Several experimenters have studied 
the interaction of these variables with 
human Ss (Edwards, 1954; 1956). 
However, there are no reports of 
such experiments with animal Ss 
although the value of such studies 
has been pointed out by Jenkins and 
Stanley (1950, p. 200). 

The general problem with which 
we are concerned is the question of 
whether or not rats’ responses can be 
predicted from the assumption. that 
following extensive training behavior 
will be of whatever sort maximizes 
reward over a series of trials. 

Precise definitions of critical terms 
will be presented now to facilitate 
description of the specific problems 
investigated and discussion of the 
results obtained. 

The term reward refers to a goal 
object and magnitude of reward is 
some physical measure of the amount 
of the goal substance (e.g., grams of 
food). Mean reward is the average 
amount of the goal object presented 
per response including nonrewarded 
responses. If only certain response 
occurrences are rewarded and _ the 
reward is the same following all 
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rewarded responses, magnitude of re- 
ward refers to its magnitude on those 
trials which are rewarded. Thus, 
with 50% of the responses rewarded 
(ordinarily described as 50% rein- 
forcement) mean reward is half the 
magnitude of reward. 

Reinforcement is the strengthening 
effect of stimuli on antecedent re- 
sponses. A more precise statement 
becomes somewhat involved. Con- 
sider two mutually exclusive choice 
responses, Ra and Rg, which would 
occur equally often (within chance 
limits) had they been regularly fol- 
lowed by the same event. If both 
responses occur many times to the 
same stimulus, each being followed 
by a different event, and following 
this training Ra occurs regularly 
while Rg no longer occurs, then the 
event (X,4) which follows Ra is (by 
definition) more reinforcing than Xz. 
Similarly, if the events following one 
or both of the responses differ from 
one occurrence of a response to the 
next and Ra comes to occur regularly 
(or predominantly), the mean rein- 
forcement following Rg is greater than 
the mean reinforcement following Rs. 

The present study was designed to 
determine which of two responses, if 
either, will become dominant after 
extensive training when mean reward 
is the same following both responses 
but magnitude of reward, and thus 
percentage of responses rewarded, is 
different. Each group of Ss received 
one unit of reward in one arm of a 
maze on every run to this side; on 
the other side they received two units 
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TABLE 1 
SUMMARY OF DIFFERENCES {IN PROCEDURES FOR THE Four Groups 


roup | E 





| 

| AML 
REM 
EFM 
EFM 





of reward on half the runs and no 
reward on the others. Two groups 
ran under water deprivation for water 
reward and two under food depriva- 
tion for food reward. Mean reward 
differed for the two groups run under 
each deprivation condition. If mean 
reward is the sole determiner of mean 
reinforcement, no significant prefer- 
ence for either response would be 
acquired by Ss in any group. 


METHOD 


Since the work reported here’ was done 
as part of three separate M.A. theses, all 
groups were not run by the same E, and 
certain characteristics of apparatus and Ss 
were not exactly the same. These differences, 
which are summarized in Table 1, are as- 
sumed to have had no measurable effect on 
the general nature of the results. Thus, 
for the most part, this report is in the form 
that would be used ‘if all Ss had been run 
under conditions that were identical except 
for deprivation and reward conditions. 

Subjects—Data are reported from 50 
naive albino rats from the colony maintained 
by the Kent State University Psychology 
Department. Age of Ss was 83-180 days at 
the beginning of training. The distribution 
of the Ss in the different groups is summarized 
in Table 1. Seven additional Ss were dis- 
carded from Group FL owing to errors on the 
part of E. Their performances were typical 
up to the time they were discarded. One 
S was discarded from Group WS when it 
failed to meet a predetermined criterion of 
making a choice within 20 min. subsequent 
to Day 4. 

Apparatus.—Three single choice point E- 
shaped alley mazes were used. All mazes 
had distinctively different goal boxes, and 
the stem and each arm were painted differ- 


Daily Trial Block 
Reward Unit 
4 “ 


Da. 1-10 | Da. 11-25 .25 gm. 

Da. 1-2 | Da. 3-36 | | 1.0 gm. 
| Da. 1-28 | 2 drops 
Da. 1-28 110 = drops 


| 


ently. Alleys in each maze were 4.5 in. wide 
and 3.75 in. deep; sides and floor were wood, 
and the roof was hardware cloth. The mazes 
differed with respect to length of sections, 
style of painting, and floor covering. Details 
are given in the theses by Leventhal (1955), 
Morgan (1957), and Morrell (1957). 

Lighting was provided by a single bulb 
above the median plane of each maze. A 
one-way vision screen was placed between E 
and the maze. 

Procedure.—There were features common 
to the treatment of all groups, namely: 
(a) every S received a reward on 100% of his 
runs to one arm of the maze and twice as 
large a reward on 50% of his runs to the other 
arm; (b) rewarded and nonrewarded runs 
to the 50% side were irregular in sequence 
within the limitation that each day (or when 
Ss received only two trials a day, within 
successive blocks of four trials) half the runs 
to the 50% side were rewarded; (c) after S 
had run to one side on haif its trials for that 
day, the door on that side of the choice point 
was closed forcing S to the opposite side 
(thus, the number of runs to each side was 
equal at the end of each day's training); 
(d) the 100%, 1 unit of reward condition was 
on the right for just half the Ss in each group, 
to counterbalance for possible position or color 
preference; (e) within each group, time in the 
goal box was the same on every trial. Table 
1 summarizes the differences between the 
groups with respect to the major variable, 
reward condition. 

At least eight days prior to the beginning 
of training the groups run to food reward 
were placed on a schedule of limited feedings 
every 24 hr. and reduced to 80% ad libitum 
body weight. This regimen was maintained 
throughout the experiment. Food rewards 
were pellets of S’s standard diet—Purina 
Laboratory Chow. The water deprivation 
groups had access to water only on rewarded 
trials and during the first hour after the 
completion of day's The 


each training. 
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Fic. 1. Learning curves for the four 
groups. Percentages of the first half of the 
day's runs which were to the 50%, 2-unit 
side are plotted for each block of eight trials 
(including all runs) for each group. 


intertrial interval was not controlled pre- 
cisely but ran between 5 and 25 min. Its 
average duration probably was longest for 
Group FL and shortest for Group FS. 

Six of the Ss in Group FL were held in a 
delay box for 30 sec. between the choice 
response and entrance to the goal box on 
every trial. Since the choices of these Ss 
did not differ systematically from those of 
the other Ss in this group, the delay was 
assumed to have no effect on the measure 
obtained and results of delay and nondelay 
Ss in Group FL were pooled. 

At all times water was available to Ss in 
Groups FS and FL, except that Ss in Group 
FS did not have access to water when they 
were in the maze. Dry food was available 
to Ss in Groups WL and WS except when 
they were in the maze. 

As indicated in Table 1, the total number 
of trials was 160 and 144 for Groups FS and 
FL, respectively, while the water groups 
were only run for 112 trials. It was originally 
planned to run them longer, but training was 
discontinued because a bitter substance had 
inadvertently been placed in the water given 
to some Ss on several trials starting with 
Trial 113. 


RESULTS AND DISCUSSION 


the data 


Throughout, 
are for the first half of the runs on 


presented 


each day. Data based on all free 
trials lead to the same conclusions 
and thus are not presented. ; 
Learning curves for each group 
are presented in Fig. 1 where per- 


centage of the first half of the day's 
runs to the 50%, 2-unit side are 
plotted as a function of blocks of 8 
trials (including both free and forced 
runs). Examination of Fig. 1 shows 
that the two small-reward groups 
acquired a marked tendency to run 
to the 50%, 2-unit side and that the 
two large-reward groups developed 
no clear preference for either side. 

The differences between the learn- 
ing curves for the two small reward 
groups is of little importance as (a) 
several factors which might affect 
learning rate differed systematically 
between the two groups, and (b) the 
magnitude of food reward comparable 
to a given amount of water is not 
known. However, it is of interest 
that for both these groups the prefer- 
ence for the 50%, 2-unit side begins 
to show up only after considerable 
training. 

In Groups FS and WS the number 
of Ss running to the 50%, 2-unit side 
on a majority of their last 12 trials 
(including only the first half of the 
trials on each day) was 11 and 12, 
respectively. Expansion of the binom- 
ial indicates that these preferences 
are significant at beyond the 1% 
level. In each of Groups FS and 
WS, 11 of the 12 Ss ran to the 50%, 
2-unit side more times in their last 
12 trials than on their first 12, indicat- 
ing that the increase in runs to the 
50%, 2-unit side cannot be attributed 
entirely tochance. Comparable tests 
for the other two groups do not 
approach significance. Presumably, 
even if a larger number of Ss or more 
extensive training should disclose a 
significant preference for the 50%, 2- 
unit side with these particular magni- 
tudes, a preference for the 100%, 1- 
unit side would be obtained if abso- 
lute magnitude were large enough so 
that consumption of one unit would 
produce satiation. 
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As measured by Fisher's exact test 
(Fisher, 1946, p. 96) the proportion 
of Ss making a majority of their 
last 12 runs to the 50%, 2-unit side 
was reliably different for the two 
food groups (P < .03) and for the 
two water groups (P < .005). Thus, 
for both food and water reward there 
is an interaction between mean reward 
and the relative reinforcing properties 
of two units of reward for 50% of the 
responses vs. one unit for 100%. 

No other trends (e.g., preference 
for the 100%, 1-unit side on the part 
of Groups FS, FL, and WL early in 
training) are significant. 


The explanation of the preference for 
50%, 2 units over 100%, 1 unit when the 
unit is small cannot be specified with 
assurance. However, the explanation 
which seems most reasonable is that 
complex internal responses which pre- 
sumably accompany consumption of 
food and water cannot be entirely 
appropriate if the goal object is quite 
small. That is, only if the magnitude 
of reward given a rat is greater than 
.25 gm. of food would the concomitant 
responses associated with consumption 
and digestion of food closely approximate 
the optimum level. The observation 
that when they have free access to food 
and water, rats ordinarily consume 
moderately large quantitites whenever 
they consume any at all is in line with 
this interpretation. 

It appears probable that further 
related experimental work which is 
integrated by a concept such as mean 
reinforcement can serve as the basis 
for construction of equal unit scales of 
the magnitude of the reinforcing effect 
of various food or water rewards. Since 
magnitude of reward and reinforcement 
magnitude apparently are not equivalent, 
construction of equal unit scales of 
magnitude of reinforcement would be 
helpful, perhaps even necessary, for the 
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development of a. precise quantitative 
behavior theory. 


SUMMARY 


Four groups of rats were run in single 
choice mazes where they received one unit 
of reward for each run to one side and two 
units for 50% of the runs to the other. Num- 
ber of occurrences of each response was 
equated by forced trials. The groups dif- 
fered in reward condition with large and 
small reward groups for both food and water. 

The two small reward groups, for which 
.25 gm. of food or 2 drops of water was equal 
to one unit, acquired a significant preference 
for the response followed by the 50%, 2-unit 
reward condition. The two large reward 
groups did not acquire a preference for either 
response. 

The results indicate that response selection 
is not determined entirely by the mean reward 
following choice responses. 
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THE USE OF ESTABLISHED CONCEPTUAL 
MEDIATIONS! 
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University of California, Los Angeles 


The study of semantic generaliza- 
tion can be carried out in either one 
of two ways. Jeffrey and Kaplan 
(1957) have attacked the problem 
by building up the associations upon 
which the generalization is based 
in the laboratory through the use 
of nonsense syllables. Bugelski and 
Scharlock (1952) used the same 
method in the study of mediated 
association. Others (Foley and Cofer, 
1943; Razran, 1949) have studied 
semantic generalization in which the 
requisite associations were assumed 
to be already present in S by virtue 
of his previous experience. 

The same hypothesis is being tested 


in either case: that a generalizing 
or mediating response is elicited due 
to some previous associations on the 
part of S and that these produce 


facilitation in learning during the 
critical portion of the experimental 
situation. The two methods differ 
only in their technique. 

It is the purpose of this investiga- 
ion to combine the two techniques in 
a single experiment so that the rela- 
tive power of each may be evaluated. 


1 The present study is based on a disserta- 
tion submitted to the Graduate Division of 
the University of California at Los Angeles in 
partial fulfillment of the requirements for 
the degree of Doctor of Philosophy. The 
author is indebted to Wendell E. Jeffrey, 
Chairman of the Doctoral Committee, and 
to John P. Seward and Irving Maltzman for 
their guidance during the course of the 
research. 

2 Now at Litton Industries, Beverly Hills, 
California. 


METHOD 


Learning material——In an attempt to 
develop a set of norms of common English 
words to be used in the study of concept 
formation, Underwood and Richardson (1956) 
obtained the association responses of college 
sophomores to about 200 words. The 
normative group was trained to give only 
responses in terms of a sense quality. The 
learning materials for the present experiment 
was taken entirely from Underwood and 
Richardson's list of norms. These lists were 
constructed giving the percentage of a 
particular response given to a word by the 
group as a whole, and if two words elicited the 
same response in a high percentage of the 
cases, they are considered by the writer to 
have preformed associations on the basis of 
the common response word. 

For example, the word ROUND was given 
as a response to the word WHEEL in 94% of 
the cases and was given as a response to the 
word GLOBE in 95% of the cases. The pair 
WHEEL-GLOBE then, is called a “‘media- 
tion pair’ in the context of this experiment, 
by virtue of the fact that the same response 
word, i.e, ROUND, was elicited by both 
members of the pair in a great proportion 
of the cases. No indication can be gotten 
from the lists, however, as to whether WHEEL 
and GLOBE are directly associated, and if 
so, to what extent this association exists. 
Pairs of this type are considered to have 
preformed associations, as opposed to post- 
formed associations which are induced in the 
experimental situation as described later. 
The words pre- and post-formed are referenced 
to the point in time at which the experiment 
begins. 

The pairs of words which appear in the 
test list are divided into two classes defined 
as mediation and non-mediation pairs on the 
basis of the value of the association as 
measured by the Underwood and Richardson 
lists. These classes were set up using two 
criteria. First, the mediation pairs are 
composed of words which had elicited a 
common response in more than 80% of the 
Ss. Second, the non-mediation pairs are 
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SEMANTIC GENERALIZATION 


TABLE 1 


| 
Treatment 


PARADIGMS FOR LEARNING 


List 3 





E, B-A 
(WHEEL—ROUND) 


E2 


B-a 
(IVY—BATON) 


b-a 
(COAL—BUCKLE) 








B-d 
(MILK—CORN) 


b-d 
(ANCHOR—COFFEE) 


composed of words which had not elicited 
a common response in more than 10% of the 
Ss. The further restriction was added that 
none of the words in the non-mediation pairs 
could have elicited in more than 10% of the 
Ss the response which is used to define any 
one of the mediation pairs. 

Experimental design.— The experiment con- 
sists of five treatments placed in what 
Lindquist (1953) calls a mixed design. This 
design permitted the use of each S as his own 
control in that he receives all of the treatments 
in the experiment. Thus, the treatment 
effects are within-S effects. The between-S 
effects will be discussed as the design is 
developed. 

The treatments are concerned with dif- 
ferent aspects of the semantic generalization 
problem and consist of three experimental 
and two control groups as follows: E,- 
Mediation pairs with the actual preformed 
association used in the training series; 
E,—Mediation pairs with a common response 
in the training series but not the actual pre- 
formed association; E;—Non-mediation pairs 
with a common response used in the training 
series; C;— Mediation pairs with no common 
response in the training series; C:—Non- 
mediation pairs with no common response in 
the training series, i.e., no possibility for 
semantic generalization to occur. 

The paradigm for the learning of the three 
lists is given in Table 1 for each treatment. 
The following notation is used in the table: 
B and C each stand for one member of a 
mediation pair; b and ¢ each stand for one 
word of a non-mediation pair; A is the actual 
response word used in determining a media- 
tion pair; a@ is a common response between 
a pair; and d and ¢ are different responses to a 


C-A 
(GLOBE—ROUND) 
C-a 
(SPINACH—BATON) 
c-a 
(HALLWAY—BUCKLE) 
C-e 
(CHALK—CAVE) 


e 
| (GINGER—ICICLE) 


B-C 
(WHEEL—GLOBE) 
B-C 
(IVY—SPINACH) 
b-c 
(COAL—HALLWAY) 


B-C 
(MILK—CHALK) 





b-c 
| (ANCHOR—GINGER) 


pair. An example of each of the pairs 
is presented under each of the symbolic 
representations. 

Each list is composed of three words for 
each of the treatment groups so that each S 
learned a 15-pair list composed of all the 
treatments. In order to counteract the 
confounding of the treatment effects with 
the difficulty of the words it would be neces- 
sary to rotate the words so that all of them 
appeared in each group. Because of the 
insufficient length of the Underwood and 
Richardson list, this was not entirely possible, 
but an approximation to it was made. The 
mediation pair treatments, E,, Es, and C,, 
were arranged in three groups so that each 
pair appeared in each of the treatments. For 
each of these three alternations the two 
treatment effects involving non-mediation 
pairs were also switched around, making a 
total of six groups. 

Six Ss were used in each of the groups in 
the experiment, making a total of 36 for the 
whole experiment. These were further di- 
vided into two groups, men and women, 
adding one more variable to the experimental 
design. 

Apparatus and procedure.—The paired- 
associate words used as the learning material 
for the experiment were typed on adding- 
machine tapes. The stimulus word alone 
was presented first, and then the stimulus 
word appeared again along with the response 
word with which it was to be associated. 
Three different random orders were used. 
A Gerbands M-1 memory drum, modified 
to accept a list of this size, was used. The 
drum revolved one space every 3 sec., and 
one blank space occurred between the end of 
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one presentation of the complete list and the 
beginning of the next. 

The Ss were drawn from the students 
enrolled in an introductory psychology course. 
Each S appeared for the experiment in one 
2-hr. session during which he learned each 
of the three lists to the criterion of three 
perfect repetitions plus two additional 
repetitions whether perfect or not. 

The S was seated before a cardboard 
screen which separated him from E and the 
major part of the apparatus. The face of the 
memory drum projected through a 3 X 5 in. 
hole in the center of the screen about 3 in. 
below the eye level of the seated S. 

After instructions for the experiment were 
read to him, S was allowed to ask any ques- 
tions regarding the procedure, and E made 
certain that S understood what he was to do. 
After the first list was started, no questions 
regarding the experiment were answered, 
and E did not communicate with S other 
than to give the brief instructions regarding 
the rest periods and subsequent lists. 

After the completion of the final list S 
was interrogated in an informal and un- 
structured manner allowing him to provide 
all the information he could without any 
direct questions from E. The purpose of 
this was to determine the extent to which the 
Ss realized the purpose of the experiment and 


could document this with specific recollection 


of the actual learning material. Because of 
the possibility of communication to future 
Ss, the full purpose of the experiment was 
not revealed to any S. 

For each S a record was made of the per- 
formance at each trial, a trial being the 
possibility of making an anticipation. The 
S's performance was recorded as correct, 
no response, or the word said if an incorrect 
word were given in anticipation. 


RESULTS AND DISCUSSION 


The data were collected in terms 
of the number of errors on the third 
or test list, an error being defined as 
either no response or an incorrect 
anticipation to the stimulus word 
when it appeared in the window of 
the drum. 

The means and SD's of errors for 
all S’s in each of the treatment groups 
are presented in Table 2. 

The design of the experiment calls 
for the data to be analyzed in terms 
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TABLE 2 
MEANS AND SDs or Errors sy ALL Ss 


IN TREATMENT EFFECTS ON THE 
Test List 





Treatment 














of an analysis of variance. This 
Treatments X Subjects design assumes 
that the interaction effects are dis- 
tributed with homogeneous variance 
among the treatments. A Bartlett 
test for homogeneity of variance 
(Edwards, 1950) was run by comput- 
ing an interaction effect for each score 
in the table and testing the five 
treatment variances of these inter- 
actions for homogeneity. The result- 
ing x’ was 23.6 (P < .001, 4 df), 
indicating heterogeneity of variance. 
Inspection of Table 2 shows that 
one of the variances, namely that 
of the E; treatment, differs markedly 
from the others. Both logarithmic 
and square-root transformations of 
the data failed to yield homogeneity 
of variance. On the basis of this 
result it was decided to drop the E, 
treatment from the analysis. When 
this was done, the Bartlett test x? was 
6.6, which for 3 df is nonsignificant. 
The analysis of variance was per- 
formed after exclusion of the KE, 
treatment, and is presented in Table 3. 

The proper error term for tests of 
the “within” effects is the MSrys. 
When these tests are made, the only 
significant effect is seen to be the 
Treatments main effect, and _this,: 
highly so. Since none of the other 
interactions is significant they may 
be pooled with the T X S interaction 
to give a better error term with addi- 
tional 33 df. This was not considered 
necessary, however, owing to the 
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TABLE 3 


ANALYSIS OF VARIANCE oF Test List Data 








Source MS 


Within Ss (201.45) 
Treatments 
TXG 
Txs 
T X Sex 

T XG X Sex 

Between Ss 








Total 











*P < 001. 


already highly significant. F ratio. 
The proper error term to be used in 
testing the ‘‘between”’ effects is the S 
main effect, and it is readily seen that 
none of these is significant. 

The critical difference between 
means for P = .05 was calculated to 
be 1.54. Reference to Table 2 will 
indicate those treatments which differ 
significantly. 

From the differences between the 
treatment means a hierarchical order 
from greater to lesser generalization 
can be formed among the four treat- 
ments. This order is E, — C; — E; 
— Cs, with the two differences at either 
end of the series, Ex, — C; and E; — Cz 
being the only nonsignificant ones. 
The questioning of Ss at the end of 
the experiment yielded the informa- 
tion that the recall of the pairs 
followed the same hierarchical order 
as the generalization as derived from 
the differences between the treatment 
means. 

Since the E; treatment was not 
included in the analysis it must be 
interpreted separately from the rest 
of the data. Inspection of Table 2 
shows that E,; was much easier than 
any of the others. This is further 
emphasized by the fact that the F 
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ratio between E; and E2, the compu- 
tation of which does not assume 
homogeneity of variance, turns out 
to be 15.80. This is highly significant. 
Twenty-eight out of the 36 Ss made 
no errors after the first presentation 
of the test list, whereas only 10 got 
perfect scores on the E, part. This 
result is to be expected since the 
basis for generalization between the 
pairs in this treatment comes not only 
from the preformed association but 
also from the reinforcement of the 
postformed association during the 
learning by the presence of the medi- 
ating response in the training list. 


What effect the presence of the pre- 
formed associations might have had on 
the learning of the other words in the 
list cannot be determined from the 
data. It may be that the presence of the 
common association between the stimu- 
lus and the response in the E; part of the 
list led S to look for similar connections 
and thus produced an added “‘instruc- 
tion’’ to him not intended by E. A 
suggestion offered to counteract this in 
further research of this type is to elimi- 
nate the E, group from the list under 
the assumption that this type of learning 
is different from, or at least of a different 
order of magnitude from, the others and 
cannot be studied in the same situation. 

The necessity for performing an analy- 
sis of covariance on the data might have 
arisen due to the differential effect of 
reinforcement in the different treat- 
ments. The Pearson r between the 
number of errors on the two training 
lists and the treatment was .59. When, 
however, the same coefficient was com- 
puted with the E, treatment omitted, r 
was only .29. This indicated that most 
of the differential reinforcement occurred 
in the E, treatment, and that with the 
omission of the E,; treatment, the 
analysis of covariance is no longer 
required. 

It may be seen from Table 2 that the 
lowest two groups with respect to learn- 
ing were the ones involving the non- 
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mediation pairs. The words in these 
two groups never appeared in the part 
of the list involving mediation pairs, 
and it might be said that the differences 
here are due to differences in the ease 
of learning the particular words rather 
than to the treatment effects themselves. 
This confounding was unavoidable be- 
cause of the inadequate tength of the 
norms for constructing the lists for this 
experiment. Two factors, however, ar- 
gue against this interpretation. The 
first is the insignificant group main effect 
and T X G interaction in the analysis, 
which shows that in whatever arrange- 
ment the words occur, there is no dif- 
ference. The second comes from an 
estimation of the relative familiarity 
of the Ss with the words taken from 
the norms constructed by Thorndike 
and Lorge (1944). The use of the ‘‘T”’ 
scale shows that the mediation pairs 
had an average occurrence of 218 per 
million and the non-mediation pairs, 
207 per million. This difference does 
not seem sufficient to account for the 
highly significant differences between 
the treatment means. 

The most obvious result in the experi- 
ment is that generalization occurred 
only in the two parts of the test list 
involving mediation pairs, since only 
these two differed significantly from 
the control part. This is interpreted 
as indicating that preformed associa- 
tions are a more powerful factor in pro- 
ducing generalization than postformed 
associations. 

Although the difference between the 
E, and C,; treatments is considerably 
larger than that between the E; and C, 
treatments, it still is not significant. 
This emphasizes the fact that the post- 
formed associations produce no generali- 
zation effect in this particular situation. 
Not only do the postformed associations 
fail to show any generalization when they 
are acting alone, but also, they fail to 
increase the effect of the preformed 
associations when they are put in com- 
bination with these. Even though this 
difference between the E, and C, treat- 
ments is not significant, the fact that 
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it is not negative is of some importance. 
Jeffrey and Kaplan (1957) found that 
generalization and not interference could 
explain the results of their study, and 
the same interpretation is invoked to 
explain these results. Although the 
addition of the postformed associations 
does not augment the generalization 
effect in the mediation pairs, neither 
does it diminish this effect by interfering 
with the learning process. 

The comparison of the C; — C, differ- 
ence and the E; — C, difference shows 
that in the total situation the preformed 
association alone produces generalization 
whereas the postformed association alone 
does not. 


SUMMARY 


Thirty-six college students learned three 
lists of paired words to mastery. ‘The lists 
were divided into five treatment effects 
representing two control groups and three ex- 
perimental groups. The experimental groups 
offered the possibility for semantic generaliza- 
tion to occur by virtue of one of two processes 
or by a combination of both of these. An 
estimation of the preformed associations was 
taken from norms developed on college 
sophomores by Underwood and Richardson. 
The postformed associations were built up by 
associating each word of the final pair to be 
tested to a common response word. The 
measure of generalization in the experiment 
was the facilitation in the learning of the test 
list produced by either of the two processes. 
The measure of learning was the number of 
errors made by the subject in anticipating 
the correct response word. 

The results of the experiment showed the 
following: (1) Postformed associations alone 
did not produce generalization between words 
in the test list. (2) Preformed associations 
alone did produce generalization between 
words in the test list. (3) The combination 
of postformed association with preformed 
association did not produce an increment in 
generalization over and above the effect of 
preformed association alone. (4) The recall 
of the subjects with respect to pairs of words 
on the test list and the training which led up 
to them, followed exactly the same order as 
the ease of generalization. 

It was concluded from these results that 
preformed associations are more effective 
in producing generalization than postformed 
associations. It was concluded also that 
the effect of the postformed associations was 














not an inhibitory one, in that the presence 
of these with preformed associations did not 
make the combination harder to learn than 
preformed associations alone. 
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Shuford (1957) has shown that 
when Ss are presented with a 20 X 20 
matrix of elements, a proportion, I, 
of which are in one state (e.g., blue 
squares, vertical bars) and the re- 
maining proportion, 1 — II, are in 
another state (e.g., red squares, 
horizontal bars), Ss are remarkably 
accurate in estimating the proportion 
of either class of element. With 400 
elements presented for a 1-sec. inter- 
val, there is no possibility that Ss 
are counting. Since Ss estimate so 
accurately when the entire population 
of elements is present, the question 
arises as to whether Ss will be equally 
accurate in their estimates if they 
sample a population of elements 
sequentially and without replacement. 
Such a situation is essentially that 
used by Humphreys (1939) and 
others (Estes & Straughan, 1954; 


! This study was done in support of ARDC 
Project 7740 while both authors were at the 
Operator Laboratory, Air Force Personnel 
and ‘Training Research Center, Lackland 
Air Force Base, Texas. Permission is granted 
for reproduction, translation, publication, 
use and disposal in whole and in part by or 
for the United States Government. This 
study has not received the review and 
approval normally given to this Center's 
reports. The views, conclusions, and recom- 
mendations expressed or implied herein do not 
necessarily reflect the official views or policies 
of the United States Air Force or of the Air 
Force Personnel and Training Research 
Center. The authors wish to thank Earl 
F. Lingle for his assistance in running the Ss 
and analyzing the data. 


Grant, Hake, & MHornseth, 1951; 
Neimark, 1956) for probability learn- 
ing where Ss are asked to predict 
which one of two possible events will 
occur on the next trial. 

It is commonly found in these 
probability learning tasks that the 
final proportion of predictions of a 
given class of events tends to approx- 
imate the relative frequency of oc- 
currence of that class. This being 
the case, one would expect that 
direct estimates of relative frequency 
as a response measure should resemble 
proportion of predictions. However, 
on a given trial S either predicts the 
event or he does not; in order to 
transform predictions into estimates 
(or response probabilities) it is neces- 
sary either to average across a block 
of trials for each S, or to average 
across Ss on a given trial. In either 
case, the grain of the resulting scale 
is a function of the number of trials 
or Ss used and one encounters all the 
hazards of averaging data (Bakan, 
1954; Estes, 1956; Sidman, 1952). 

The purposes of this study were (a) 
to obtain direct estimates of event 
relative frequency, (b) to compare 
these estimates with relative fre- 
quency of prediction of the event, 
and (c) to determine how both 
response measures are affected by 
availability of information concerning 
the preceding trials. 
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PROBABILITY LEARNING 


METHOD 


Subjects.—The Ss were 72 Air Force 
recruits. All Ss had an Armed Forces 
Qualification Test score of 65 or above, which 
is roughly equivalent to a Stanford-Binet 
1Q of 108 or more (Gragg & Douglass, 1956). 
Their modal age was 17. 

Apparatus.—Three decks of one hundred 
3 X 5-in. index cards were stacked face down 
on a modified desk calendar stand. Each 
card had one of two letters printed upon it. 
A cover card atop each deck contained the 
two letters which would appear in that deck; 
the letter whose frequency was to be esti- 
mated was underlined. The letters of the 
alphabet were paired randomly without 
replacement. Three sequences of card orders 
were drawn randomly with II = .67; the 
actual relative frequencies of the underlined 
letter in the three decks were .67, .63, and 
.65. The within-deck sequences and order 
of decks were the same for all Ss. 

Procedure*—The S was seated at a desk 
containing the three-deck stack of cards, 
three mimeographed answer sheets, and a 
pencil. He was told by E that he had three 
well-shuffled decks of cards, each card of 
which contained one of the two letters on the 
cover card for that deck, and that his task 
was to determine what percentage of the 
cards in the deck contained the letter under- 
lined on the top card. At the beginning of 
each deck, S was required to write down an 
initial estimate of the percentage of cards 
in the deck which would contain the under- 
lined letter. On each trial for 100 trials 
per deck, S wrote in the first column of the 
answer sheet his prediction of the letter which 
would appear on the card to be turned up 
and, after turning the card, he copied in the 
second column the letter which appeared. 
Two groups of 18 Ss each, the PI and PC 
groups, filled in only these two columns of 
the answer sheet for each card. Two other 
groups of 18 Ss each, the PEI and PEC 
groups, were required in addition to write 
in a third column their estimate of the 
percentage of cards in the deck which would 
contain the underlined letter. In other 
words, Ss in these groups made a new estimate 
on each trial in addition to the initial esti- 
mate which was required of all Ss. After 
completing the first deck, all four groups of Ss 
went directly on to the second and third 
decks, each of which used a different pair of 
letters. 

2 The study presented is part of a larger 
experimental design which, because of a 
variety of misfortunes, was never completed. - 
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One-half of the Ss, the PC and PEC 
groups, each received a large index card with 
which to cover the preceding predictions and 
outcomes (and estimates in the case of the 
PEC group). The remaining groups, PI 
and PEI, used no cover cards and could 
refer back to preceding trials. Each of the 
four groups was run in two sessions of 9 Ss 
each; each session required about an hour. 


RESULTS AND DISCUSSION 


The mean relative frequency of 
prediction of E, (the more frequent, 
i.e., underlined, letter) on successive 
10-trial blocks is given in Fig. 1. 
Data are plotted separately for each 
of the four groups and three decks. 
The data differ from the results of 
most studies employing this type of 
procedure in two respects: (a) with 
the exception of the last deck the 
curves do not start at a chance level; 
(6) predictions for the PE groups do 
not level off at II as those for the P 
groups do. Analysis of variance of 
E, predictions on the last 30 trials of 
each deck shows that the frequency 
of E; prediction is significantly higher 
for the PE groups than for the P 
groups (F = 11.10, 1 and 68 df). 
Although there is a good deal of 
within-group variability, the mean 
relative frequency of E,; for the P 
groups approximates II whereas the 
entire distribution is shifted upward 
in the case of the PE groups with a 
mean exceeding II. Neither informa- 
tion, nor deck effects, nor any inter- 
action is significant at the .05 level. 
Evidently, the availability of informa- 
tion concerning preceding events has 
no effect on the final level of mean 
E, prediction. 

When mean estimates of E, (Fig. 2) 
are compared with proportion of E, 
predictions (Fig. 1), it is apparent 
that estimates also tend to start out 
at a higher than chance level. The 
final levels of E, estimates appear to 
approximate II. Comparisons of final 
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Proportion of E, predictions in successive 10-card blocks for each of the 


four groups on each of three decks. 


level of mean E, estimates with 
final level of E, predictions on the 
last 30 trials of each déck indicate 
that: (a) predictions are consistently 
higher than estimates for the PE 
groups (F = 6.87, 1 and 68 df); 


whereas, (b) predictions for the P 
groups are not significantly different 
from estimates for the PE groups 


(F = 2.08, 1 and 68 df). Although 
comparison among final levels for 
the PE groups suggests that predic- 
tions are higher and estimates are 
lower (i.e., more accurate) when in- 
formation concerning the past events 
is available, neither information effects 
nor interaction effects is significant 
at the .05 level. 


Comparison of successive E,; predic- 
tions for the P groups with successive E, 
estimates for the PE groups (Fig. 2) 
supports the suggestion from compari- 
sons of final levels that predictions and 
estimates are comparable response meas- 
ures in this situation. Data on groups 
giving estimates only (EI and EC) 
would, of course, provide a stronger basis 
for this conclusion and it is unfortunate 
that these data are not available. How- 
ever, data on EI groups from similar ex- 
periments show that Ss follow the 
sequence quite closely and suggest that 
data for an EI group would differ from 
that for the PEI group only in decreased 
within-group variability. Although es- 
timates and predictions appear to be 
comparable measures it is clear that Ss 
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Comparison of proportion of E, estimates for the P groups with mean E, estimates of 
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Proportion of E; predictions on successive 10-card blocks in relation to actual 
Data are based upon an independent group of 198 Ss; actual E, proportion 


represents the mean of nine random sequences with Il = .67. 


in the usual probability learning situation 
are not all estimating the relative fre- 
quency of E, before making predictions, 
for when Ss are required to estimate as 
well as predict E,, the frequency of 
prediction is significantly higher than 
when estimates are not required. 

Initial E, estimates and predictions.— 
It will be remembered that both esti- 
mates and predictions of E, started at an 
above-chance level. In order to deter- 


mine whether probability learning curves 


starting from a chance level could be 
obtained in this situation, 198 Ss were 
run under a modified PC procedure in 
which initial estimates were not required. 
Nine random sequences of 300 cards were 
drawn with II = .67. Twenty-two Ss 
were run on each deck. For eleven of 
these Ss the letter which occurred with 
probability .67, E,, was underlined on 
the top card; for the other eleven Ss the 
letter which occurred with probability 
.33, Es, was underlined. Proportion of 
E, prediction was tabulated for all Ss. 
The results are plotted in Fig. 3 along 
with the actual proportion of E,’s in the 
nine random sequences (broken line). 
The curve starts at a chance level and 
shows a negatively accelerated increase 
to an asymptote about equal to .67. 
Clearly the traditional probability learn- 
ing curves can be obtained in this experi- 
mental situation. 

Our procedure in the main experiment 
differed from the traditional procedure in 
that all Ss were required to give an 
initial estimate of E, frequency at the 
start of each deck. We had assumed 


that the distribution of initial estimates 
(Bayes distribution’) would be sym- 
metrical about a mean of .50, but the 
obtained distributions were negatively 
skewed with means of .57, .63, and .60 
for the three decks. Thus, it would 
appear that underlining one of two 
letters and requiring S to estimate its 
relative frequency at the start of the 
experiment tends to increase his initial 
estimate. Moreover, it has already been 
shown that requiring S to estimate E, 
throughout the deck raises the level of 
concomitant E, prediction. Hence, the 
initial estimate alone should raise the 
initial level of E,; prediction. When no 
additional estimates are required, the 
effect of the initial estimate should decay 
on successive trials so that the final level 
of E; prediction is not affected. This 
effect is borne out in the data for the PI 
and PC groups. 


SUMMARY 


A factorial design was used in a probability 
learning situation to investigate the effects 
of (a) response required and (b) information 
available. Four groups of 18 Ss each went 
through three decks of 100 cards, each card of 
which contained one of two letters. The 
three 100-card sequences were randomly 
drawn with a probability of .67 for one of the 
letters and .33 for the other. Two groups 


‘Investigation of the effect of manipula- 
tion of initial Bayes distribution has been 
undertaken by Shuford and Land, who have 
developed a model for the prediction of 
successive estimates in this situation (Shuford, 


1957). 
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of Ss were required only to predict which 
letter would appear on each card; the other 
two groups were required to predict 
similarly and also on each trial to estimate 
the proportion of cards in the deck containing 
one of the two letters. For half the Ss only 
the prediction and outcome of the current trial 
was visible; for the other half the record of 
preceding trials was visible. 

Prediction proportions on the last 30 cards 
of each deck approximated .67 for the groups 
which predicted only, and was significantly 
higher for the groups which predicted and 
estimated. Estimates for the latter two 
groups were similar to the proportions of 
predictions of the prediction-only groups. 
Deviations from theoretical values for both 
initial and final levels of prediction were 
discussed in terms of the effects of estimates 
upon predictions. Availability of informa- 
tion had no significant effect upon predic- 
tions or estimates. 
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SOME CONDITIONS GOVERNING THE USE OF THE 
CUE-PRODUCING. RESPONSE AS AN 
EXPLANATORY DEVICE! 

JOSEPH pe RIVERA 
U. S. Naval School of Aviation Medicine 


Two similar stimuli are made 
more distinguishable if a response is 
learned to one stimulus and a different 
response is learned to the other stimu- 
lus. It has been suggested (James, 
1950; Miller & Dollard, 1941) that 
the different responses add different 
stimuli to the original stimuli thus 
making the original stimuli more 
disparate. This concept of cue-pro- 
ducing responses leading to an ac- 
quired distinctiveness of the stimuli 
to which they are attached has been 
fruitfully used as an explanatory 


concept by Dollard and Miller (1950). 
While in general the above concept 
has been useful, in work with rats by 


Lawrence (1949), with children by 
Cantor (1955), and with adults by 
Gagne and Baker (1950), some recent 
evidence suggests that the manner in 
which the concept is applied must 
be defined more closely: 


Robinson (1955) had one group of Ss 
learn a different name to each of 10 different 
fingerprints, and another group of Ss learn 
one name to five of the prints and a second 
name to the other five prints. Assuming 
that each of these responses furnished S 
with additional cues, we might expect the 
first group to have acquired a distinctiveness 
of the fingerprints while the second group 
acquired a similarity of the fingerprints. 
However, when Ss were given a task involving 
print discrimination, both groups performed 
equally well. 

Goss (1953) had some Ss learn names to 
four different light brightnesses while a 
control group was simply told to pay atten- 


! Opinions or conclusions contained in this 
report are those of the author. They are 
not to be construed as necessarily reflecting 
the view or the endorsement of the Navy 
Department. 


tion to the lights. After this training, Ss 
learned to select a different switch to each 
light brightness. Only those Ss with 100% 
overlearning of the names performed better 
than the control group on the second task. 

In a recent review of the literature on 
stimulus predifferentiation, Arnoult (1957) 
concluded from the available data that: 
“Positive transfer from stimulus predifferentia- 
tion training may be expected after a minimum 
of 4 to 8 pretraining trials and reaches a maxi- 
mum after 8 to 12 pretraining trials” (p. 341). 
Arnoult further concluded that a satisfactory 
explanation for the positive transfer obtained 
from predifferentiation training would be 
developed only when the physical and experi- 
mental aspects of the stimulus could be better 
described. 

A preliminary experiment by the present 
author suggested one way in which the 
conception of the stimulus should be refined 
in order to apply the concept of the cue- 
producing response to successfully explain 
the above-mentioned data. The Ss first 
learned letters, then numbers, to a set of 
fingerprints. Paralleling Robinson's results, 
those Ss who learned two letters to 10 prints 
showed as much transfer as those who learned 
10 letters. However, many Ss who had 
learned 10 letters stated they had tried to 
forget the letters and memorize the numbers 
directly to the prints. Furthermore, when 
Ss who had learned two letters indicated 
the characteristics of the prints to which 
they had memorized these two letters, it 
became apparent that most of them had 
actually learned 10 responses. That is, 
they had singled out 10 different character- 
istics, one from each print, and had memorized 
a letter to each of these characteristics. 
Instead of learning that all five prints called 
by the same letter showed, perhaps, pro- 
nounced loops, S would learn that one print 
had a whorl in the lower left corner, another 
had a slash in the middle, and so on. 


The above experiments led to two 
hypotheses as to the manner in which 
the concept of cue-producing re- 
sponses should be applied. Let overt 
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responses be defined as those responses 
which E requires S to give when 
different stimuli are presented, and 
let perceptual responses be defined as 
those responses which constitute S’s 
representation of the stimulus. In 
the following experiment the percep- 
tual responses are inferred to be the 
selective seeing of certain character- 
istics of the fingerprint stimuli, and 
are represented verbally by such 
statements as ‘‘a whorl in the center,” 
“a very dark print,” or “wavey 
lines.” It is then hypothesized that: 
(a) overt responses learned to the 
stimuli will not be used to furnish 
cues for additional learning unless 
these overt responses are strongly 
attached (overlearned) to the stimuli; 
and (b) in cases where the overt 
responses are not overlearned and 
hence not used to furnish cues, 
perceptual responses learned to the 
stimuli will be used as cues for further 
learning, and hence, the number and 


kind of these perceptual responses 
will be more important than the 
number and kind of overt responses. 

To test the first hypothesis Ss were 
given either 8 or 16 trials to learn to 
write 10 letters in response to 10 


fingerprints. Then Ss were given 8 
trials to learn 10 numbers to the 
same print. The Ss who learned 
letters for 16 trials should more 
frequently use these letters as cues 
for learning numbers to the prints. 
Having these additional cues avail- 
able, they should show more positive 
transfer. 

To test the second hypothesis, Ss 
were given 8 trials to learn two 
letters to the 10 prints. Some Ss 
were told to learn each letter to the 
characteristics of each individual print. 
Their perceptual responses should be 
similar to those of the Ss who learned 
10 letters to the individual prints. 
Another group of Ss was told to 
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memorize each of the two letters to 
the characteristics which were pos- 
sessed in common by the prints with 
the same letter. Their perceptual 
responses should be different from 
those of the other Ss. The second 
hypothesis predicts that, although 
only two overt responses were learned, 
those Ss who learned 10 perceptual 
responses should perform as well on 
the transfer task as Ss who learned 
(but did not overlearn) 10 overt 
responses. On the other hand, those 
Ss who learned only two perceptual 
responses should not have as many 
cues available and hence should not 
transfer as well. 


METHOD 


Subjects —Groups of naval aviation cadets 
were tested during their first week of pre- 
flight training at Pensacola, Florida. The 
size of the groups varied from 19 to 36 men. 
The Ss were told that the test was for research 
purposes only. 

Procedure.—A set of 10 different finger- 
prints reproduced on slides served as stimuli 
for all the groups tested. On the first trial 
a slide was projected for 1 sec., then its correct 
response (a letter or number) was announced 
and the slide remained on the screen for two 
more seconds. Immediately afterwards, the 
next slide of the set was shown. After this 
introductory trial, Ss were given 8 (or 16) 
training trials. On these trials a slide was 
shown for 4 sec., during which time Ss 
were required to guess and write down the 
slide’s correct number. After this time the 
correct number was announced; in the next 
2 sec. S marked his guess right or wrong, and 
had a chance to associate the correct number 
with the slide, then the slide was removed. 
(In order to detect any tendency for S to 
write the numbers immediately after they 
were announced, on Trials 4 and 8 the num- 
bers were not announced until all 10 slides 
had been shown.) Four seconds after a 
slide was removed, a new slide was presented 
until all 10 fingerprints had been shown. 
The Ss then counted their correct guesses 
and wrote down the sum. After 30 sec. 
the same 10 slides, mixed up in a different 
order, were again presented until Ss had 
completed 8 (or 16) trials. The order in 
which the slides were presented was initially 
determined randomly. 
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Initial training.—Group A was given 8 
trials to learn 10 letters (A—J) to the set of 10 
fingerprints. On Trial 8 an average of 7.8 
letters reported correctly indicated a sub- 
stantial amount of learning had occurred. 

Group B was given 16 trials to learn 10 
letters (A—J) to the set of 10 prints. 

Group C was given 8 trials to learn two 
letters to the set of 10 prints. Half of the 
prints were arbitrarily called “C” and half 
were called “G.”" The Ss in Group C were told 
“. . . there is nothing in common between 
the five slides called ‘G.’ I just arbitrarily 
picked five of them and called them ‘G.’ 
Thus, when you memorize the slides don't 
waste your time looking for common char- 
acteristics—you'll just get mixed up. In- 
stead, start right away to memorize to each 
individual slide its correct letter.” 

Group D was given 8 trials to learn two 
letters to the set of 10 prints. The same 
prints which had been arbitrarily called ‘“‘C” 
and “G"’ for Group C were again called “C” 
and “G."" However, Ss in Group D were 
told, “. . . five of these slides have certain 
aspects in common and are called ‘C,’ the 
other five slides also have certain character- 
istics in common and are called ‘G.’ It is 
your job to learn to call each slide by its 
correct letter. Obviously, the best way to do 
this is to spot the common characteristics 
of the ‘C’ slides and the ‘G’ slides.” 

Group E received no initial training. 

Transfer training.—Approximately 5 min. 
after the initial training, all five groups 
learned a new set of responses, the numbers 
1-10, to the same set of fingerprints used in 
the preliminary training. Each S received 
eight trials on this new task. After these 
trials Ss were asked to give the character- 
istics of the slides which they had used in 
memorizing the responses. 


RESULTS 


The mean num- 


Initial training. 
bers of errors made by each group in 
the first eight training trials are as 


follows: Group A—27.6 (SD == 13.0); 
Group B—30.2 (SD = 14.3); Group 
C—19.5 (SD = 8.3); Group D—18.7 
(SD = 8.7). 

As was expected, the means and 
SD’s of Groups A and B (which had 
to give 10 responses) are much larger 
than the means and SD’s of Groups 
C and D (which had to give only two 
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responses). As is proper, however, 
the mean and SD of Group A are not 
significantly different from those of 
Group B; and the mean and SD of 
Group C are not significantly different 
from those of Group D. 

Stimulus characteristics important 
in transfer training.—After the eight 
transfer learning trials, Ss in Groups 
C and D were asked to write or draw 
the characteristics of the different 
fingerprints to which they had memo- 
rized the responses ‘“‘C"’ and “G.” 
Analysis of these characteristics showed 
that 29 of the 36 Ss in Group C gave 
a different characteristic for each 
slide. As instructed, most of Group 
C had memorized the two responses to 
10 different slide characteristics. In 
Group D, 26 of the 30 Ss gave char- 
acteristics which were common to 
the “C”’ slides or to the “G”’ slides. 
For example, one S selected the fact 
that “almost all C’s were lighter,” 
and another remarked, “most C’s 
had narrow loops.”’ As instructed, 
most Ss in Group D had memorized 
the two responses to characteristics 
common to the slides with the same 
letter. 

The Ss in Groups A and B were 
asked to indicate for each number 
(1-10) whether they had memorized 
the number to the previously learned 
letter or directly to some character- 
istic of the print. If to the letter, 
they were asked to write the letter 
down beside the number. A fre- 
quency distribution shows that Ss 
in Group A used a letter to mediate 
39% of their transfer trials. The 
Ss in Group B used a letter to mediate 
60% of their transfer trials. 

Since the distributions did not 
approximate normality, a Mann-Whit- 
ney U test was run. The differences 
are significant at the .05 level and 
part of Hypothesis 1 is thus con- 
firmed. Many Ss in Group B made 
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TABLE 1 





Mean 





P (One-Tail) 





16.63 
12.35 
16.39 
22.13 
28.26 











-04 
.04 
.02 














statements such as, “At first I got 
the number from the letter; gradu- 
ally I grew to recognize the number 
from the print."’ Several Ss indicated 
the mnemonic device they used to 
learn the number to the letter such 
as, ‘‘the word 60 for six-d.”’ 

Transfer training.—The mean num- 
ber of correct responses made on each 
trial by the various groups during 
transfer training is shown in Fig. 1 
and the total number of errors made 
by each group is shown in Table 1. 
As predicted by Hypothesis 1, Group 
B made fewer errors on the transfer 
task than did Group A. The differ- 
ence is significant at the .04 level 
(one-tail). 

In order to establish that the superiority 
of Ss in Group B was due to their greater 
utilization of the letters as cues, Ss in both 
Groups A and B were divided at the median 
into those who reported using many and 
those who reported using few letters, and 


into those who made many and those who 
made few errors, and a chi-square test was 
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Fic. 1. 


Mean number of correct responses on 
each trial of transfer training. 


run. Those Ss who more frequently used 
the letters as cues tended to make less ertors 
on the transfer task (P = .04, one-tailed). 
However, inspection of the data showed that 
Ss who used the letters tended to have made 
fewer errors when they initially learned the 
letters to the prints. 

An analysis of covariance was used in order 
to eliminate the effects of this ability differ- 
ence. The Ss were placed in a 2 X 2 table 
according to whether they had used more or 
less than the median number of letters, and 
according to whether they had received 8 
or 16 trials in which to learn the letters. 
Quenouille’s (1948) technique was used to 
correct for unequal N’s, and the sums of 
squares were adjusted to allow for differences 
in initial ability. The interaction term was 
tested by using the within-cells mean squares 
as anerror term. An F = .62 was obtained. 
Since the interaction was insignificant, the 
interaction and within-cells mean squares 
were pooled to test the row and column 
effects. The column (number of trials) 
effect was significant at the .03 level (one- 
tailed). The row (use of letter as cue) 
effect was insignificant (F = .68). Thus, 
when account is taken of the greater learning 
ability of those Ss who use letters as cues, the 
use of letters does not significantly improve 
S’s transfer learning. 


As predicted by Hypothesis 2, 
Groups A and C made approximately 
the same number of errors, while 
both groups made less errors than 


did Group D. The difference be- 
tween Groups A and C is negligible. 
The difference between Groups C and 
D is significant at the .04 level (one- 
tail). The difference between Groups 
A and D fails to reach the .05 level 
of significance when the unequal 
variances are equalized by the use of a 
Behrens-Fisher test. The evidence 
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cannot be called conclusive but clearly 
supports Hypothesis 2. 

The transfer task variances of 
Groups C, D, and E are approxi- 
mately equal to each other and to the 
initial task variances of Groups A and 
B. However, all five of these vari- 
ances are far greater than the transfer 
task variances of Groups A and B. 
The differences are all significant 
between the .05 and .01 level (two- 
tails). Evidently, practice in learn- 
ing 10 different overt responses to the 
prints significantly reduced the vari- 
ance of the distribution of errors 
made in the later learning of 10 
responses to the prints. Practice 
in learning two different overt re- 
sponses to the prints did not lead to 
this effect. 


DISCUSSION 


We tentatively conclude that the ex- 
tent to which overt responses, learned to 


stimuli in preliminary training, will be 
used to mediate the learning of new 
responses, is a function of how well the 
overt responses are learned to the initial 


stimuli. Thus, Group B used the re- 
sponses to a greater extent than Group A. 
However, Ss who reported using these 
mediating responses seemed to have no 
advantage. The superiority of Group B 
must be attributed to some other factor. 
Perhaps Ss in Group A incurred more 
negative transfer because they received 
an equal amount of training on both the 
initial and transfer tasks. Perhaps, even 
when an overt response was not reported 
as the primary cue, it still served to help 
differentiate the print by becoming part 
of a complex stimulus pattern, and thus 
the degree to which the response was 
learned was important. 

When an overt response is not well 
learned to a stimulus it will not function 
as a primary cue for later learning. 
However, the necessity for an overt 
response will cause S to learn several 
other responses incidentally. These 
other responses include adjustive re- 
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sponses, such as when and how to look 
for the stimulus and when and how to 
give the required response, and percep- 
tual responses such as ‘‘one slide has a 
loop in the center,” another has 
“straight lines on the bottom,"’ and so 
forth. The perceptual responses may 
themselves furnish cues for future learn- 
ing, even though the initial overt re- 
sponses which occasioned the perceptual 
responses to be made have been for- 
gotten. Therefore, in studying the ac- 
quired distinctiveness of cues we must be 
sophisticated as to just which responses 
are furnishing additional cues. In the 
experiment reported above, when the 
overt responses (letters) were not well 
attached to the stimuli they did not 
usually function as cues. However, the 
groups learning different perceptual re- 
sponses (Groups A and C) showed more 
transfer than the group learning similar 
perceptual responses (Group D). There- 
fore, the perceptual responses did func- 
tion as cues and their number and 
similarity were more important than the 
number and similarity of the overt 
responses. This interpretation also may 
account for the results of Robinson 
(1955). We need simply postulate that 
his group which learned two overt 
responses to fingerprints actually learned 
10 perceptual responses as well, as did 
groups in our preliminary studies. 

It is interesting to note that when we 
view the transfer resulting from stimulus 
predifferentiation in this second way, 
that is, as a function of the perceptual 
responses learned rather than the overt 
responses taught, we are in effect saying 
that the labels taught S are irrelevant 
and merely force him to search for 
discriminable aspects of the stimuli. 
This, however, is the “attention to cues” 
hypothesis mentioned by Arnoult (1957) 
and favored by Hake and Eriksen (1955) 
as an explanation for the positive transfer 
resulting from stimulus predifferentia- 
tion. We may, therefore, deduce this 
point of view from a consideration of the 
general cue-producing response. 

While we can say that perceptual cues 
probably played a role in the discrimin- 
ability of our fingerprints, we cannot say 
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whether the distinctive perceptual re- 
sponses aided discrimination or the 
common perceptual responses hindered 
discrimination or, as is most likely, both 
of these effects occurred. To state the 
direction of the effect another group 
would be necessary in order to control 
for the practice effects which were 
probably one reason the groups with 
preliminary training learned the numbers 
with greater ease than the group which 
had no preliminary training. In other 
words, we cannot say here how much of 
our effect is due to the adaptive responses 
we mentioned above. It should also be 


noted that Ss in Group D learned the 
general observing response of looking 
for common rather than discrete charac- 
teristics in the fingerprints; this habit 
might account for part of their failure to 
transfer as much as the other groups. 


SUMMARY 


Conflicting studies on the effect which 
teaching differential responses to cues has 
on the acquired distinctiveness of these cues 
led to the following two hypotheses: (a) 
Distinctive overt responses attached to 
stimuli will not be used to furnish additional 
cues for later learning unless the responses 
are strongly attached (overlearned) to the 
stimuli. (6) When overt responses are not 
strongly attached to the stimuli, the percep- 
tual responses made in the attempt to learn 
the overt responses furnish cues for later 
learning to the stimuli and hence are more 
important than the overt responses in deter- 
mining future transfer. 

To test these hypotheses one group of Ss 
(Group A) learned distinctive overt responses 
to a set of stimuli; Group B overlearned the 
same distinctive overt responses to the 
stimuli; Group C learned equivalent overt 
responses but distinctive perceptual responses 
to the stimuli; Group D learned equivalent 
overt responses and equivalent perceptual 
responses to the stimuli; and Group E learned 
no responses to the stimuli. Immediately 
thereafter all groups were required to learn 
a second set of responses to the same set of 
stimuli, 

All groups that received preliminary 
training did better than the group that 
received no training. As _ predicted by 
Hypothesis 1, the group that overlearned the 
initial responses used cues from the initial 
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responses to a greater extent than the group 
that did not overlearn the initial responses. 
The group with overlearning also demon- 
strated more transfer. However, this su- 
periority was apparently not due to the 
greater use of initial responses as cues. As 
predicted by Hypothesis 2, there was no 
significant difference between the group that 
learned distinctive overt responses and the 
group that learned equivalent overt responses 
but distinctive perceptual responses, yet both 
of these groups showed more transfer than 
the group that had learned equivalent overt 
and equivalent perceptual responses. 
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CHANGES IN PERFORMANCE AS A FUNCTION OF SHIFTS 
IN THE MAGNITUDE OF REINFORCEMENT ! 


GEORGE COLLIER anp MELVIN H. MARX 


University of Missouri 


Shifts in magnitude of reinforce- 
ment result in changes in performance. 
The course of the change and the 
final limit of change are two questions 
of major interest. 

Both overshooting of the expected 
asymptote of performance when the 
magnitude is increased and under- 
shooting when the magnitude is 
decreased have been reported (Crespi, 
1942, 1944; Spence, 1956; Zeaman, 
1949). Both gradual and abrupt 


changes in performance following a 
change in magnitude of reinforcement 
have been observed (Crespi, 1942, 
1944; Dufort & Kimble, 1956; Pere- 
boom, 1957a, 1957b; Spence, 1956). 
In these previous experiments on 


shifts in reinforcement, S’s experience 
with the first magnitude of rein- 
forcement was gained concurrently 
with the acquisition of the instru- 
mental response; the shift in rein- 
forcement occurred at some point 
after asymptote had presumably been 
reached (Spence, 1956). In such a 
procedure the acquisition of the 
instrumental response and the experi- 
ence with the reinforcement value 
are confounded. The present experi- 
ment examines the effect of shifts 
in magnitude of reinforcement when 
the experience of the first magnitude 
of reinforcement and the acquisition 


1 This investigation was supported by a 
research grant, M-817, from the National 
Institute of Mental Health, of the National 
Institutes of Health, Bethesda, Maryland. 
A brief report on the data was made at the 
meetings of the Midwestern Psychological 
Association, Detroit, 1958. The authors 
wish to thank Gopaul Sharma for his very 
careful collection of the data. 


of the instrumental 
independently. 


res} »ONnse Occur 
: 
METHOD 


Apparatus.—Four Skinner boxes deliver- 
ing liquid reinforcements were used. The 
magazines were 12-in. diameter aluminum 
(free-machining) plates, } in. thick, with 
seventy-two j-in. diameter cups drilled 
around the periphery. The maximum vol- 
ume of each cup was .4 ml. A second plate 
on top of the first covered the cups and 
prevented evaporation. The plate was ad- 
vanced by a Geneva movement driven by a 
synchronous motor. The operation of the 
magazine was barely audible. A Gerbrands 
bar activated by a force of approximately 22 g. 
was used. The magazine was located below 
the rat cage, which was 8 X 9} X 7-in. cage 
of the same sort used to house the animal. 
The whole assembly was located in a sound- 
proofed, light-tight, ventilated ice-box hull. 
The box was dimly illuminated. 

Subjects.—Sixty female albino rats of the 
Sprague-Dawley strain acquired from the 
Holtzman Company were placed on a 23-hr. 
privation routine a week prior to the begin- 
ning of the experiment. The hour of feeding 
followed the experimental session by 1 hr. 
Losses through experimental error, failure of 
Ss to begin taking all rewards within four 
days, sickness, and random discards to equal- 
ize the number in each group reduced the 
number of Ss to 45. 

Procedure.—Three groups of 15 Ss were 
magazine trained on 4%, 11.3%, and 32% 
sucrose solutions, respectively, for eight days 
(% conc. = we. eee *« 100) 
wt. solute + wt. solvent 
The magazine operated automatically on an 
aperiodic schedule with a geometric average 
of 1 min. The times between present&tions 
were 4, 1, 2, and 4 min. Twenty rewards 
were offered in the 20-min. session. All 
groups were then B-P (bar-press) trained 
on 11.3% sucrose solution, directly on a 1- 
min. periodic reinforcement schedule for ten 
days. The use of a test cage of the same 
kind as S’s home cage and of an aperiodic 
schedule during magazine training produces 
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Fic. 1. Mean number of bar presses per 
20-min. session for Ss bar-press trained on 
11.3% and magazine trained on 4%, 11.3%, 
or 32% sucrose solutions. 


very rapid conditioning of the B-P. Four 
days of extinction followed. 

Skinner boxes and positions in the home 
cage rack were counterbalanced over groups. 


RESULTS 


The number of reinforcements taken 
was determined for each rat during 
magazine training. An analysis of 
variance’ of these numbers showed 
no sigificant difference among groups, 
P> .O5 (F = 1.18; df = 2 and 42). 
The mean number of reinforcements 
taken was 151.1 out of the 160 
offered. 

The mean number of bar presses 
for each group for each session during 
acquisition and extinction is presented 
in Fig. 1. An analysis of variance 
for Groups, Trials, and Replications 
is presented in Table 1. The main 
effects of Groups and Trials were 


significant, while their interaction 


*The dropping of Ss obviously restricts 
the variance. The number dropped did not 
itself differ from group to group. It has 
been our experience that two to three contacts 
with the solutions is sufficient to lead to 
asymptotic respondings, whereas Ss that do 
not drink within the first two to three days 
almost never do. 
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TABLE 1 


ANALYSIS OF VARIANCE OF NUMBER OF 
Bark PRESSES 























Source df MS F 
Between Ss 44 
Groups (G) 2 | 88,543 | 3.27* 
Replications (R) 2 | 20,879 .80 
GXR 4 | 32,674 1.25 
Error, 36 | 26,195 
Within Ss 405 
Trials (T) 9 | 66,971 | 37.95** 
TxXG 18 1,183 67 
TXR 18 2,464 1.40 
TXGxXR 36 1,587 .90 
Errory 324 1,765 
Total 449 
*P< 0S. hin 
P< O1. 
wasnot. Replications and their inter- 


actions failed to reach significance. 
A test of the homogeneity of variance 
among groups failed to reach sig- 
nificance, .20 < P < .30 (x? = .379; 
df = 2). Individual ¢ tests between 
the 4% and 11.3% groups and be- 
tween 11.3% and 32% groups did 
not reach significance. The average 
difference between the 4% and 11.3% 
groups was slightly greater, 28.9, than 
that between the 11.3% and 32% 
groups, 18.5.4. The group trained on 
4% and tested on 11.3% was superior 
to the group trained on 11.3% and 
tested on 11.3%, while the group 
trained on 32% and tested on 11.3% 
was inferior. There was no trend 
over the 10 sessions which would 
indicate that these differences were 
diminishing. A similar analysis of 
variance on the extinction data showed 
the same results, the main effects of 
groups and trials being significant 
while their interaction was not. 


* An examination of these differences by a 
medians test yielded probabilities between 
the 5 and 10% level for both differences, 
indicating that the failure of the ¢ tests to 
reach significance is not due to skewness. 
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DISCUSSION 


The occurrence of abrupt changes in 
performance following shifts in the 
magnitude of reinforcement, as well as 
their bidirectional character, has led to 
the identification of the magnitude of 
reinforcement with motivational rather 
than habit factors (Dufort & Kimble, 
1956; Hull, 1951; Seward, 1956; Spence, 
1956). The failure of the limits of the 
course of change to conform with the 
asymptotic values obtained with Ss 
trained on the new magnitudes has led 
to the introduction of additional emo- 
tional factors, dramatically called elation 
and depression (Crespi, 1942, 1944) or to 
the assumption that the effective rein- 
forcement on any trial is determined by 
the previous history of magnitudes of 
reinforcement (Jones, 1952; Spence, 
1956). 

Three sorts of mechanisms have been 
invented to collect the properties of the 
magnitude of reinforcement: (a) emo- 
tions with drive properties resulting from 
the delayed gratification of the fractional 
consummatory response (Spence, 1956) 
or the frustration of expectations (Crespi, 
1942, 1944); (b) motivational properties 
associated in an unspecified fashion with 
the magnitude of the fractional goal 
response, r,, (Spence, 1956); and (c) 
variation of the effective reward as a 
function of expectation determined by 
the chain of preceding reinforcement 
magnitudes (Jones, 1952). In the first 
two mechanisms, which were introduced 
in part to account for abrupt shifts in 
performance via motivational changes, 
no provision is made to handle the effects 
of preceding reinforcement experience 
on present reinforcement effectiveness. 
Spence’s (1956) association of K (in- 
centive motivation) with the magnitude 
of r,, which is classically conditioned, 
implies that K should change as a func- 
tion of trials following a shift in rein- 
forcement magnitude. His interpreta- 
tions of his own data belie this implica- 
tion in that K was introduced to account 
for the abrupt shifts in performance. 
Spence notes in several places the 
occurrence of such abrupt shifts in his 
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and other data (e.g., Spence, 1956, pp. 
130-133). This difficulty with K was 
long ago pointed out by Leeper (1952). 
Jones (1952), in an attempt to deal with 
this problem, has modified Hull's postu- 
lates to substitute expectation for habit. 
His formulation is based on the assump- 
tion that the effective value of a reward 
is related to both the present magnitude 
and the preceding sequence of rewards. 
He has formalized this assumption in a 
way such that the other relations gen- 
erated by Hull’s sHpr and sEpr concepts 
remain unchanged. 

The results of the present experiment 
seem to offer difficulties for each of these 
mechanisms. Neither the fractional goal 
response nor emotional states produced 
by lack of immediate gratification can 
handle the obtained outcome. On the 
assumption that the magnitude of the r, 
determines K, one would predict that the 
32% group would have the r, of greatest 
rate of responding, while the 4% group 
should show the lowest.* Further, using 
the notion of the classically conditioned 
r,, one would expect all values to con- 
verge over trials to the level of perform- 
ance of the 11.3% group. Similarly, the 
present data rule out any interpretation 
in terms of motivation produced by 
frustration or delayed gratification of the 
r, since, again, both the direction and 
duration of the effect are wrong. An 
account in terms of elation or depression 
is not plausible since the duration of the 
effect is too long. It seems unlikely that 
one would be still celebrating or com- 
plaining 10 days and 200 reinforcements 
later. While Jones’ account takes care 
of the problem of the effect of preceding 
rewards, it does not suggest the variables 
which govern the magnitude of the shift 
effect. 

Thus it appears that the identification 


‘Preliminary results obtained in our 
laboratory indicate that this prediction is 
verified when a stimulus associated with 
different magnitudes of reinforcement for 
different groups during magazine training 
is used as a reinforcer. In this case, the 
greater the magnitude of reinforcement the 
greater the reinforcing power of the associated 
stimulus. 
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of magnitude of reinforcement directly 
with either the asymptote of habit 
strength or a motivational factor is in- 
correct. 

Recent studies of the amount of rein- 
forcement function have suggested that 
performance is a function of the stimulus 
dimensions of reinforcement (Collier, 
1958; Collier & Siskel, 1959). In this 
case it would be expected that reinforce- 
ment would have the properties of a 
sensory scale. One of the variables 
known to influence sensory magnitude is 
the sequence of preceding judgments, as 
demonstrated, for example, by the 
adaptation level or the anchoring effects 
observed in the absolute judgment ex- 
periments. Similarly, the exposures to 
the different concentrations during maga- 
zine training would be expected to lead 
to a different ‘‘sweetness” for the 11.3% 
concentration for the three groups of rats. 
Thus, one could argue that the rein- 
forcing effects of a given magnitude of 
reinforcement, that is, for example, its 
sweetness, saltiness, bitterness, etc., are 
a joint function of the classical psycho- 
physical variables, among which are the 
sequence of preceding stimulations. 
Thus, if one assumes that reinforcement 
has psychological dimensions, one can 
account for many of the effects which 
have been observed following shifts in 
magnitude of reinforcement, as well as 
the over- and undershooting of the 
expected asymptotes. This view would 
require, as do the Jones and Spence 
formulations, that the curves of the 
present experiment should eventually 
converge. It seems reasonable to as- 
sume that the effects of preceding stimu- 
lations diminish with successive stimula- 
tion. The present experiment shows 
that their effects are not transitory but 
leaves unanswered the questions as to 
their duration and to the variables which 
govern their duration. 

These comments obviously do not 
explain the shift effects, since the ‘‘judg- 
mental” effects are not themselves ex- 
plained. They merely amount to a 
reclassification of the class of events to 
which the shift effects belong. 

The present design avoids the problem 
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of the interaction of the effects of shifts 
in reinforcement with changes in the 
performance of the instrumental re- 
sponse. There are a number of pieces 
of data in the literature which suggest 
that this may not be a single problem. 
For example, in the studies so far re- 
ported, those which have used one trial 
per day have shown abrupt shifts (e.g., 
Crespi, 1942; Spence, 1956, pp. 130-131; 
Zeaman, 1949) while those studies using 
more than one trial per day have shown 
relatively more gradual shifts (Dufort & 
Kimble, 1956; Pereboom: 1957a, 1957b). 
There is a suggestion in some prelimi- 
nary data from our laboratory and some of 
Guttman’s (1953) data that the gradual- 
ness is even more pronounced if the 
shifts occur within a session, where the 
more transient residual effects of the 
preshift experience are still present. On 
the other hand, Pereboom (1957a) has 
made a case for the differential effects 
of competing responses (exploration of 
the runway in his case) for Ss trained 
under different magnitudes of reinforce- 
ment. Further, both Spence (1956) and 
Pereboom (1957a, 1957b) have pointed 
out roles which the asymptote of respond- 
ing can play in determining the sorts of 
effects which occur following shifts. 

This diversity of outcomes with respect 
to the course of change in performance 
following a shift in magnitude of rein- 
forcement suggests that the course of 
change may be influenced by a number 
of variables not associated with the 
magnitude of rewards. The “effective- 
ness” of a reinforcer is a joint function 
of the present value of the stimulus 
characteristics and previous contacts 
with these or other values. The char- 
acter of the course of shift is a function 
of the situation in which it is determined. 


SUMMARY 


Three groups of rats were magazine-trained 
on either 4%, 11.3%, or 32% concentrations 
of sucrose solutions. All groups were then 
bar-press (B-P) trained’ on 11.3%. The 4% 
group was significantly superior to the 11.3% 
and the 32% significantly inferior for the 10 
days of rewarded B-P and the four days of 
extinction. 
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The consequences of these results for the 
motivational and habit interpretations of 
reward were discussed. It was argued that 
the effects which have led to these interpreta- 
tions can be subsumed under the phenomena 
observed in determinations of psychological 
scales. 
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DISTRIBUTION OF PRACTICE 
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While the influence of massing of 
practice on performance of a motor 
skill can be readily demonstrated, 
studies concerned with the effects 
of this variable on the rate of growth 
and asymptote of habit strength have 
often seemed inconclusive. Theoret- 
ical predictions to the contrary, a 
number of studies (e.g., Archer, 
1954; Bourne & Archer, 1956; Dig- 
man, 1956; Reynolds & Bilodeau, 
1952) have failed to find clear-cut 
evidence for any permanent decre- 
ment such as Hull’s (1951) spr, or 
the inhibition variables postulated by 
Ammons (1947) and Kimble (1949). 
Denny,. Frisbey, and Weaver (1955) 
found “several lines of evidence” 
favoring the concept of conditioned 
inhibition in motor skill learning. 
However, the fact that the decre- 
mental function observed by them 
extinguished under distributed prac- 
tice suggests a concept somewhat 
different from the traditional slp 
function. A similar phenomenon was 
observed by Digman (1956), who 
favored an explanation in terms of 
transfer theory, although the extine- 
tion interpretation is admittedly more 
basic. 

Bourne and Archer (1956), measur- 
ing total time on target and time 
continuously on target (duration of 
hits), proposed that only one type of 
inhibition exists. This is the familiar 
temporary decrement, Jpg, which is 
presumed to dissipate rapidly with 
rest. (However, Janhke and Duncan 
[1956] have found evidence for traces 
of Tp after 24 hr. of rest.) Concern- 
ing Jr, Bourne and Archer (1956, 





p. 31) suggested that “. . . in addi- 
tion to reducing performance .. . it 
interferes with the acquisition of skill 
and thereby reduces the actual level 
of learning of the habit . . . [espe- 
cially that part associated with ] 
smooth pursuit tracking skill.” 

A study by Adams (1952), which 
was indirectly concerned with s/p, 
suggested the possibility of a con- 
vergence of the performances of two 
practice groups, one with practice well 
distributed, the other with practice 
considerably massed. In this study 
there were five practice sessions on 
as many consecutive days. Whether 
the performance of the massed prac- 
tice group would have risen to the 
level of the performance curve for the 
distributed practice group, had there 
been another day of practice under 
distributed practice conditions for 
both groups, is a question immediately 
suggested by an inspection of Adams’ 
curves, particularly in view of the 
conclusions reached by Bourne and 
Archer. It is this question which 
prompted the study reported here. 


METHOD 


Subjects. —The Ss were 41 men and women 
drawn from two classes in psychology con- 
ducted by the investigator. Approximately 
half of the Ss were volunteers from an intro- 
ductory course; the rest were students in a 
course in the psychology of learning, who 
participated in the experiment as an essential 
part of the course. In general, Ss were 
assigned to experimental conditions in order 
of their appearance except for an attempt to 
equalize sexes in the groups. One S was 
assigned, out of order, to the distributed 
practice group, Group D. The N in Group 
D was 22 (11 men and 11 women), and the 
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DISTRIBUTION OF PRACTICE 


N in Group M, the massed practice group, 
was 19 (10 men and 9 women). 

Apparatus.—A pursuit rotor manufactured 
by the Ralph Gerbrands Co. was used. The 
target button was 12.5 mm. in diameter, 
imbedded in a disc 25.5 cm. in diameter. 
The target button revolved at the rate of 30 
rpm in a clockwise direction and in an epi- 
cyclic fashion. A brass stylus, hinged 12.0 
cm. from the target end, with a tip 5.5 mm. 
in diameter, completed a circuit with the 
scoring element of a Standard Electric clock, 
whenever the tip was in contact with the 
target button during a work period. Target- 
to-floor distance was 35 in. An electronic 
interval timer manufactured by the Photo- 
switch Co., with interval accuracy of approx- 
imately 99%, was used to obtain the desired 
work-rest intervals. 

Procedure.—Six practice sessions were 
given to both groups over a two-week period 
on days Monday, Wednesday, and Friday. 
To eliminate bias associated with the longer 
rest from Friday to Monday, the first practice 
session was staggered, with S, beginning 
on Monday, S2: on Wednesday, etc. Each 
practice session consisted of eighteen 30-sec. 
trials. For Group D there was a rest interval 
of 1.5 min. between trials throughout the 
experiment. For Group M this interval was 
2 sec., which provided E the opportunity to 
read and reset the scoring clock. On Day 6 
both groups were treated alike, with 1.5-min. 
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Throughout the experiment an attempt 
was made to motivate Ss to maximum effort 
by stressing the alleged importance of the 
research, by asking each S to “do his best,” 
and by reporting S’s score to him at the 
conclusion of each trial. 

At the final practice session, Ss were told 
at the conclusion of Trial 108 that there was 
to be a test of ability consisting of four trials. 
The Ss were again told of the importance of 
the experiment and urged to do their best. 


RESULTS 


The over-all empirical results may 
be seen in Fig. 1. Something of the 
convergence effect obtained by Adams 
(1952) may be noticed, a convergence 
which seems more evident from a 
comparison of the terminal perform- 
ances of both groups on succeeding 
days, and less so from a comparison of 
initial performances. At the begin- 
ning of Day 6, the difference in per- 
formance between the two groups is 
approximately equal to the initial 
difference for Days 4 and 5, and not 
much less than that observed for 
Days 2 and 3. 








intervals between trials. Following Trial Performance for Group M on Day 
108 there were four “test trials.” 6, when practice conditions were 
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identical to those of Group D, does 
lend support to the original hypothesis 
that, had practice continued indefi- 
nitely, the performance curves for 
the two groups would have converged. 
However, the data suggest rather 
than demonstrate a convergence, and 
a one-sided test of the difference 
between groups for the sum of Trials 
109-112 (the test trials) implies that 
the groups are still operating at 
different levels. (The test used was 
the Mann-Whitney signed ranks test, 
and the obtained z was 2.56, which is 
significant at the .01 risk level.) 

An interesting resemblance may be 
noted between the performance curve 
for Group M on Day 6 and the curve 
for Group D on Day 4. This may be 
some indirect evidence of Archer's 
(1954) contention that under massed 
practice conditions Ss do not practice 
as much as they would under condi- 
tions of distributed practice. Some 
of the implications of this resemblance 
are more fully considered below. 


Parameters of growth in skill.—As 
an index of skill level (Sk) for Group 
D, the mean of the last two trials of 
each practice session was computed. 
This index was used for two reasons: 
the presence of warm-up decrement 
during the earlier stages of practice 
for each session, lasting for some 
unspecified amount of time, and the 
need for a more stable index of skill 
than a single trial mean. 

A number of different types of 
curves were examined for their appli- 
cability to the six points determined. 
Easily the best fit was accomplished 
by use of a modified Hull (1951) 
type exponential curve. The equa- 
tion for this curve, which appears 
in Fig. 2, is: 


Ska =15.2(1-10-275") 413.2 [1] 


where Ska represents the skill level 
for the distributed practice group 
attained at the end of a practice 
session, and m the number of practice 
sessions. 





DISTRIBUTION OF PRACTICE 


Because of the extensive amounts 
of work decrement usually present 
for Group M (except for Day 6), a 
different procedure was followed to 
determine skill level at the conclusion 
of each practice session under massed 
practice. It was first assumed that 
the amount of intersession forgetting 
was the same for both groups. 
(Indeed, there is evidence [Jahnke & 
Duncan, 1956] for little, if any, 
forgetting of rotary pursuit skill over 
a 2- or 3-day period.) A_ second 
assumption was that differences in 
performance at the beginning of any 
session reflected differences in skill 
level at that point. This difference, 
present at the beginning of a session, 
was then subtracted from the value 
for skill level for Group D at the 
conclusion of the previous practice 
session, and used as an index of skill 
level for Group M. Through the five 


points thus obtained, a modified Hull- 
type exponential curve was passed. 


The equation for the curve, which 
appears in Fig. 2, is: 


4- 
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(2] 

A measure of warm-up decrement, 
present at the beginning of each 
practice session, was computed by 
subtracting the obtained score for the 
first trial from the skill level attained 
at the end of the previous session. 
These values appear in Fig. 3, to- 
gether with an extrapolation to zero 
practice. (Due to the manner of 
computing skill level for Group M, 
warm-up values computed in the 
above fashion would be the same for 
either practice condition.) 

Shrinking the practice scale for 
Group M.—Since the performance of 
Group M on Day 6 bore such a 
resemblance to the performance of 
Group D on Day 4, it was tentatively 
concluded that at the beginning of 
Day 6, after 90 trials, Group M had 
received about as much practice as 
Group D had received at the begin- 
ning of Day 4, after 54 trials. There- 
fore, it was inferred that, under condi- 
tions of massed practice, at least for 


SRm = 17.3(1-10~ 8") +-8.1 
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Theoretical curves of skill growth (values for massed practice 


on shrunken practice scale). 


the first three practice sessions, Group 
M received .60 as much practice per 


trial as Group D. The practice 
scale was accordingly shrunken to .60 
of the scale for Group D, and skill 
values, as determined previously, 
plotted. The skill values for both 
groups, those for Group M on the 
shrunken scale, appear in Fig. 4. 
Here the curve and values for Group 
D are from Equation 1, while the 
curve for Group M has been drawn 
by inspection, using the asymptote 
and other values of Equation 2. 


DISCUSSION 


Two alternative interpretations of the 
data are available: the traditional two- 
factor theories of Hull (1951), Ammons 
(1947), and Kimble (1949) ; or the single- 
factor theory of Bourne and Archer 
(1956). The present study provides 
some evidence for the former in the 
failure of the performance curve for 
Group M to rise to the level of Group D 
on the last day of practice. The curve 


for Group M on the final day does, how- 
ever, strongly suggest a convergence, 
and it may be argued that there was 
insufficient practice on the final day to 
develop those aspects of skill associated 
with “‘smooth pursuit tracking skill.” 
The results of the study are perhaps 
better understood according to the 


‘views of Bourne and Archer concerning 


the effects of massing of practice. As 
the curves in Fig. 4 suggest, massing of 
practice apparently set an asymptote of 
skill somewhat lower than did distributed 
practice. The fact that the values for 
Equation 2, when plotted on a shrunken 
practice scale, coincide with the Group D 
skill curve during the early stages of 
practice, bears out Archer’s (1954) 
contention that under massed practice 
Ss learn less in part because they practice 
less. Beginning with Day 3, however, 
Group D apparently began to learn not 
only more per trial but also something 
which Group M failed to learn (at least 
until Day 6). This is very likely the 
“smooth pursuit tracking skill” for 
which Bourne and Archer (1956) have 
some evidence. The failure of the per- 
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formance curve for Group M to rise to 
the level of Group D may be attributed, 
as the curves in Fig. 4 suggest, to a 
considerable difference in practice in 
those parts of skill associated with high- 
level performance. Since the curves 
would imply that Ss in Group D began 
to learn these elements of skill perhaps 
as early as the beginning of Day 3, Group 
D may have had in this respect, by the 
end of the experiment, a fourfold ad- 
vantage in practice over Group M. 
While the computed skill values are 
unusually well described by the equations 
derived for them, they obviously fail to 
coincide at the point of zero practice. 
Extrapolation of the skill curve for 
Group M to the point of zero practice 
gives a value of 8.1. Actual performance 
for Group M on Trial 1 was 5.71. The 
difference between this and theoretical 
skill level, 2.4, is, by reference to the 
warm-up decrement curve, Fig. 3, a 
reasonable value. However, the skill 
curve for Group D, when extrapolated 
to zero, gives a value of 13.2. Actual 
performance was 4.83, which, even with 


the addition of 2.5 for warm-up decre- 
ment, gives a value considerably short of 
the skill level for zero practice derived 


from the equation. A _ possible inter- 
pretation of this state of affairs is that 
the equation is not applicable during the 
earliest part of practice, and that a 
different and more rapid growth of skill 
occurs under distributed practice at this 
stage. 

The advantage of distributed practice 
over massed practice is, then, possibly 
threefold: (a) More learning occurs per 
trial under distributed practice, because 
more practice received. (b) The 
growth of skill under distribt ted practice 
continues to a higher final level, very 
likely because of the acquisition of a type 
of response difficult to achieve under 
massed practice. (c) For some reason, 
early learning under distributed practice 
may be especially advantageous. 


1s 


However, the repeated failure to find 
conclusive evidence for any permanent 
decrement, even under the severely 
contrasting conditions of this study, 
suggests that in some situations massing 
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of .practice, far from causing any perma- 
nent damage to skill, may actually be a 
very economical type of practice. While 
Ss in the present study who learned 
under massed practice conditions may 
have received only 60% as much practice 
per trial as Ss under distributed practice 
conditions, they spent only 9 min. in the 
experimental situation; the distributed 
practice Ss spent more than 30 min., 
most of which time was devoted to 
reading magazines or gazing out a win- 
dow. An implication of the present 
study might be that some of this time 
could have been more profitably given 
over to practice, at least that portion of 
training after the beginning and before 
the acquisition of high-level skill. 


SUMMARY 


The growth of a motor skill (rotary 
pursuit) was investigated, using two groups 
of Ss under contrasting degrees of distribution 
of practice. Both groups received eighteen 
30-sec. trials per day for six days. For one 
group intertrial rests were 1.5 min. through- 
out the study. For the other group inter- 
trial rests were 2 sec. for the first five days 
of practice and 1.5 min. on Day 6. A test 
of four trials, separated by 1.5 min., was given 
after Trial 108, under conditions assumed 
to be fairly optimal in respect to motivation 
and practice conditions. 

Results indicate that the chief effect of 
massing of practice is to reduce the asymptote 
of skill, possibly by introducing conditions 
which make difficult the learning of the type 
of response necessary for high-level per- 
formance. It seems likely, however, that a 
switch in practice conditions, from massed 
to distributed practice, is accompanied by a 
corresponding shift in asymptote that 
ordinarily associated with distributed prac- 
tice. There was evidence that under massed 
practice Ss received considerably less practice 
per trial than did Ss under distributed prac- 
tice. Possible for a difference 
between “early” and “later” learning under 
distributed practice was tentatively offered 


to 


evidence 
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RESPONSE DECREMENT, INDUCED BY STIMULUS 
CHANGE, AS A FUNCTION OF AMOUNT 
OF TRAINING! 
DALBIR BINDRA anp JOHN F. SEELY 
McGill University 


‘Fink and Patton (1953) observed 
that the incidence of a_ learned 
drinking response decreased when the 
various visual, auditory, and tactual 
characteristics of the training situa- 
tion were altered. They also found 
the response decrement to be propor- 
tional to the number of stimulus 
characteristics changed from the train- 
ing to the test situation; the greater 
the stimulus change the greater the 
response decrement. The present in- 
vestigation is concerned with explor- 
ing how the amount of training given 
prior to introducing a stimulus change 
affects the influence of that change 
on an instrumental response. Such 
an investigation was considered in- 
teresting because it could provide 
one direct test of Bindra’s (1959; 
in press) formulation concerning the 
nature of stimulus-change induced 
response decrement. Furthermore, 
this formulation appeared to lead to 
predictions that are either opposed 
to those arising from the statements 
of Hull (1943, 1947) and of Guthrie 
(1935), Estes (1950) and Estes and 
Burke (1953), or do not necessarily 
follow from them. 

According to Bindra’s (1959; in 
press) formulation, stimulus change 
from the training to the test situation 
evokes some new, “‘irrelevant’’ re- 
sponses, and it is the interfer- 


( Relative response decrement = 


ence from these responses that leads 
to a decrement in the trained re- 
sponse. The new responses are con- 
sidered to be reactions to the novelty 
provided by the changed (i.e., test) 
situation, and in test situations in- 
volving only a small stimulus change, 
the new responses are thought to 
be mostly of an “exploratory” 
nature. The degree of novelty or 
strangeness of the test situation is 
necessarily dependent upon the extent 
to which the subject has already 
become familiar with the training 
situation. Thus, for any given stimu- 
lus change, the greater the training 
in the training situation, the greater 
would be the novelty offered by the 
test situation, and hence the greater 
would be the interfering exploratory 
responses evoked during the tests. 
Assuming that the exploratory re- 
sponses evoked by the training situa- 
tion reach a stable level after a few 
training trials, the ratio of these 
responses to the exploratory responses 
evoked by the stimulus change would 
decrease with increased training in 
the training situation. It follows 
from these propositions that the 
greater the training given before intro- 
ducing a stimulus change, the greater 
would be the relative response decre- 
ment brought about by that stimulus 
change. 


training performance — test performance 


training performance ) 


1 This study was supported by a grant (A.P. 27) from the National Research Council of 


Canada. 
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Furthermore, according to this view, 
the more highly trained Ss, as com- 
pared to those trained less well, 
would, with continued reinforcement, 
require longer to reach the level of 
performance existing before the stimu- 
lus change, assuming that the greater 
the novelty presented by a stimulus 
change the longer will the interfering 
exploratory responses persist. To test 
these predictions, a stimulus change 
was introduced into a simple runway 
after groups of rats had received 15, 40, 
or 80 reinforced (thirst—water) training 
trials, and the reinforcement was con- 
tinued during the tests until the per- 
formance of all groups reached an 
asymptote. 


METHOD 


Subjects.—The Ss were 36 male hooded 
rats about 100 days old. They were obtained 
from the colony maintained by the Royal 
Victoria Hospital in Montreal. A_ supply 
of Purina chow pellets was always available 
in their cages, but they were subjected to a 
23-hr. water deprivation schedule throughout 
the experiment. 

Apparatus.—A large piece of plywood, 
45 X 12 in., served as the floor of the runway. 
The runway itself was 36 in. long and 6 in. 
high, and was made of j-in. plywood. The 
long sides of the runway were attached by 
removable bolts to a 12 X 6-in. plywood 
piece at each end; this construction made it 
possible to vary the width of the runway from 
5 to 10 in. A starting box (84 X 3 X 6 in.) 
was placed at one end of the runway and was 
separated from it by a guillotine gate. At 
the other end of the runway, a glass drinking 
tube was inserted through a hole in the end 
piece, 2 in. from the floor and equidistant 
from the two sides. The whole runway was 
covered with a removable }-in. wire mesh top. 

During training, the width of the runway 
was set at 5 in., and the floor was covered 
with sawdust. The stimulus change, intro- 
duced after the appropriate number of train- 
ing trials, consisted of doubling the width of 
the runway to 10 in. and covering the floor 
with }-in. wire mesh instead of sawdust. 

Procedure.—On the first 8 days of the 
experiment, the (water-deprived) Ss were 
placed on a feeding stand to drink from water 
bottles for a period of about 40 min. All Ss 
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became accustomed to drinking in this man- 
ner by the end of this period. On the 
following day each S was given two trials 
on the runway ; each trial involved traversing 
the distance from the starting box to the 
drinking-tube end of the runway, and drinking 
water for about 10 sec. The time elapsing 
between the removal of the guillotine gate 
and the arrival of S at the other end of the 
runway was recorded with a stopwatch. 
Next day, Ss were given three trials each, 
and thereafter each S was given five daily 
trials. Immediately after the daily trials 
were over, each S was allowed 20 min. of 
free access to water on the feeding stand. 
Then it was returned to its cage. 

The Ss were split into three equal groups 
of 12 Ss each. Groups I and III were 
equated by dividing 24 Ss between them on 
the basis of their performance on the first 
15 trials; Group II was not equated with the 
others in this way, but its performance during 
training turned out to match remarkably 
well that of Group III. Group I was given 
a total of 15 training trials (none after 
grouping), Group II, 40, and Group III, 80 
before the stimulus change was introduced. 
Following the stimulus change, Ss in each 
group were continued on the same schedule 
of five daily reinforced trials until the mean 
daily running time reached an asymptote. 


RESULTS 


Initial effects of stimulus change.— 


The mean running time for five 
daily trials (the initial five training 
trials, which were spread over two 
days, were considered as the trials 
for the first day of training) was 
calculated for each S, and the mean 
of these figures for each group was 
determined. These group mean run- 
ning times are shown in Fig. 1. It 
is seen that there was no response 
decrement (i.e., increase in running 
time) following the stimulus change 
in the case of Group I; its mean 
running time on Day 4, the first test 
day, was Jess than its running time 
on Day 3, and its performance on 
Day 4 and on subsequent days (see 
the points on Day 8) was similar to 
the performance of the other groups. 
Groups II and III, on the other hand, 
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the different groups. 


showed a significant increase in the 
mean running time following the 
stimulus change; Group II's running 
time for Day 9 is significantly more 
than its running time on Day 8 
(P < 01), and the mean running 
time for Group III on Day 17 is 
similarly different from the corre- 
sponding figure for Day 16 (P < .01). 
(All P’s are based on ¢ tests; the use 
of the appropriate nonparametric tests 
would not alter the reported findings.) 

However, the stimulus change did 
lead to some response decrement even 
in the case of Group I, though this 
effect was overcome within two trials 
and does not show itself in the five- 
trial mean. This decrement is seen 
when the running times for individual 
trials just before and after the 
stimulus change are plotted, as in 
Fig. 2. Any possible “regression” 


Daily mean running times for the three groups of rats, which received 15, 40, or 80 
trials before the stimulus change was introduced. 


The arrows indicate the first test day for 


in the strength of response from the 
last trial on one day to the first trial 
on the following day cannot account 


for the obtained decrement. No evi- 
dence for such regression could be 
found in any of the groups. For 
example, in the case of Group III, 
there was a median decrease in running 
time of 1.5 sec. from Trial 5 on Day 3 
to Trial 1 on Day 4. Similarly, in the 
case of Group I, there was an improve- 
ment (median 1.3 sec.) in running 
time from the last trial of Day 2 
to Trial 1 of Day 3. 

Figure 2 shows that for Group I 
the first (test) trial after the stimulus 
change is as different in absolute 
terms from the running times im- 
mediately before the change as are 
the postchange running times of 
Groups II and III different from their 
prechange running times. When an 
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Fic. 2. Mean running time for Groups 
I, Il, and III on each of the 5 trials preceding 
and on each of the 10 trials following the 
stimulus change. The arrow indicates the 
first test trial for all groups. 


prechange running time 
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index of 
(mean of 
within the 


preceding the stimulus change) is 
compared with the running time for 


the first test trial, it is found to be 
significantly different (P < .01) from 
the latter for all the groups. This 
index of prechange running time had 
to be devised to serve as a stable 
measure of prechange running time 
in the case of Group I. The Ss in all 
groups showed considerable trial-to- 
trial variation and a rapid decline 
in running time on Day 3 because 
training was still at an early stage. 
Since Day 3 was the last day of 
training for Group I, an index such 
as the one employed had to be 
devised in order to obtain a measure 
of prechange running time that was 
stable and that also incorporated 
the improvement in performance on 
Day 3. That this index does not 
grossly underestimate the running 
times on Day 3, thus adventitiously 
yielding the obtained decrement on 
the first test trial in the case of Group 
I, is suggested by the fact that this 


index for Day 3 was higher than the 
running time on Trial 1 on Day 4 
for 7 of the 12 Ss in Group II and 
for 9 of the 12 Ssin Group III. This 
index was necessary only in the case 
of Group I, but it was used for all 
groups for the sake of consistency; 
by Trial 40 (Group II) and Trial 
80 (Group III) there appeared to be 
no marked difference between this 
index (mean of the three fastest 
trials) and the daily mean (based on 
five trials). 

'Even a cursory examination of 
Fig. 2 shows that there are no sig- 
nificant differences between groups 
in the absolute increase in running 
time following the stimulus change, 
the mean increases for Groups I, II, 
and III being 13.4, 15.7, and 14.8 
sec., respectively. However, the rela- 
tive increase in running time, that 
is, the ratio of the difference between 
the index of prechange running time 
and test running time to the index 
of prechange running time, increased 
with increased training, the means 
being 1.7 for Group I, 5.8 for Group 
II, and 8.0 for Group III. The differ- 
ence in this ratio between Group | 
and Group III is significant at the 
01 level; and the difference between 
Groups I and II is significant at 
the .05 ievel, but the difference be- 
tween Groups II and III is not 
significant. 

Return to the prestimulus change 
level of performance.—In this experi- 
ment, reinforcement was continued 
during the test trials in order to 
determine any possible group differ- 
ences in overcoming the effects of 
the stimulus change (see Fig. 2). 
The number of trials required by each 
S to reach, following the stimulus 
change, the level of its own index 
of the prechange running time was 
determined, and the group means 
calculated. These means were 4.3, 
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6.8, and 10.8 for Groups I, II, and 
III, respectively. The difference be- 
tween Groups I and III is significant 
at beyond the .01 level, and the 
other differences at beyond the .05 
level. These results show clearly 
that the greater the training given 
before the stimulus change was intro- 
duced the more trials it took Ss to 
overcome the response decrement. 


DISCUSSION 


The results of the present experiment 
demonstrate clearly that, with increase 
in training prior to the introduction of a 
stimulus change, there is an increase in 
relative response decrement as well as in 
the number of reinforced trials required 
to re-reach the prestimulus-change level 
of performance. Both these results were 
predicted from, and are consistent with, 
Bindra’s (1959; in press) formulation 
of the nature of stimulus-change in- 
duced response decrement, as described 
earlier. The present results are also 
relevant to the views of Hull (1943, 
1947) and of Guthrie (1935), Estes 
(1950) and Estes and Burke (1953), 
and it might be profitable to consider 
how far our results are consistent with 
these views. 

It is difficult to say what the Hullian 
prediction for this experiment would 
be because there is no unequivocal way 
of deciding how much of Hull’s system 
to take into account. At a fairly simple 
level, his statements on “stimulus gen- 
eralization”’ seem to apply to the present 
experiment. According to Hull, in the 
early stages of training, stimulus generali- 
zation is complete or nearly so, but, 
with increasing amounts of training, 
“a downward sloping gradient of pri- 
mary stimulus generalization of reaction 
strength gradually and spontaneously 
develops” (Hull, 1947, p. 133). Hull 
considers generalized response strength 
also to be a function of response strength 
generated during training (Hull, 1943, 
p. 199), so that any training after a 
gradient has been established would 
result in progressively greater generalized 
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response strength at each point on the 
gradient. Furthermore, if the stimulus 
change were made after a very large 
amount of training, though there should 
be an increase in the absolute level of 
response strength throughout the gradi- 
ent, there would be little reason to expect 
any further change in response decre- 
ment; however, for any given amount 
of response decrement from the training 
to the test situation, there should be a de- 
crease in relative response decrement, i.e., 


training performance-test performance 


training performance 


Applying these propositions to the 
present experiment, one should expect 
first an increase in relative response 
decrement, followed in this case where 
the absolute response decrements for 
the three groups are roughly the same, 
by a decrease in relative response 
decrement. However, the present re- 
sults show a continuous increase in rela- 
tive response decrement. As mentioned 
above, other predictions for the results 
of our investigation can also be derived 
by taking into account other aspects of 
Hull’s principles, and it is possible that 
some of those predictions will be con- 
sistent with the present results. Thus, 
at present, no clear disproof of Hull's 
views is possible. 

According to Guthrie (1935) and the 
formulations of Estes (1950) and Estes 
and Burke (1953), which are based on 
Guthrie’s general viewpoint, the prob- 
ability of occurrence of a trained response 
in the test situation is a function, 
roughly, of the number of stimulus 
elements in the training situation that 
are also present in the test situation. 
Inasmuch as the stimulus elements that 
become conditioned to the response 
with increased training are assumed to 
be randomly distributed within the 
total stimulus elements in the situation, 
the proportion of the conditioned stimu 
lus elements operating after the stimulus 
change would seem to be the same at all 
levels of training. 
that a given stimulus change 
would cause the same relative response 


Therefore, one should 
expect 
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decrement irrespective of the amount of 
training given prior to the stimulus 
change. This prediction is also con- 
trary to the findings of the present 
experiment. However, here again, by 
making certain assumptions about the 
relative weight of the new elements 
introduced by the stimulus change at 
different levels of training, a correct 
prediction can be derived from such a 
stimulus-elements model. It should be 
noted that, basically, Bindra’s formula- 
tion is consistent with the view that the 
occurrence of a response depends upon 
the proportion of stimulus elements 
that are common to the training and the 
test situations. However, it further 
states that a change in stimulus elements 
produces response decrement by virtue 
of evoking exploratory responses that 
interfere with the trained response, and 
that the extent to which the exploratory 
responses are evoked is determined, 
at least partly, by the degree of famili- 
arity developed with the training situa- 
tion. Whether ‘degree of familiarity” 
can be incorporated into the formulations 
of Estes (1950) and Estes and Burke 
(1953) remains to be seen. 


SUMMARY 


This study deals with the relation between 
the amount of instrumental training given 
prior to introducing a stimulus change and 
the influence of that change on the trained 
response. 

A stimulus change was introduced into a 
simple runway after groups of rats had 
received 15, 40, or 80 reinforced (thirst— 
water) training trials, and the reinforcement 
was continued during the tests until all 
groups reached an asymptote. The results 
showed the relative response decrement 
brought about by the stimulus change to 
increase as a function of the number of train- 
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ing trials. It was also found that the greater 
the training given prior to introducing the 
stimulus change the more (reinforced) trials 
it took the animals to overcome the response 
decrement (i.e., to return to the prechange 
level of performance). 

It was suggested that these results require 
further clarification of the formulations of 
Hull, Guthrie, and Estes, but are quite con- 
sistent with Bindra’s view concerning the 
nature of stimulus-change induced response 
decrement. According to the latter view, 
the amount of response decrement depends 
upon the degree to which the novelty pro- 
vided by the stimulus change evokes inter- 
fering “exploratory” responses; and the 
novelty of a given stimulus change is thought 
to increase with increased training under 
the conditions existing before the stimulus 
change is introduced. 
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Though it seems to be a generally 
accepted fact that the effect of an 
increase in motivation is to increase 
stimulus generalization, there are 
differing conceptions of the nature 
of this function. Within the Hullian 
framework (Hull, 1943), since gen- 
eralization is a habit function, it 
would be expected that the effect 
of drive would be multiplicative—i.e., 
the generalization gradients of Ss with 
similar training but with different 
drive levels during testing would be 
multiples of each other. Miller (1944), 
on the other hand, has suggested 
that such gradients would be parallel, 
i.e., additively related. The present 
study was designed to investigate 
generalization as a function of drive 
level, using different levels of food 
deprivation in pigeons, the key- 
pecking response, and the stimulus 
dimension of the visual spectrum. 


EXPERIMENT I 
Method 


Subjects. —Thirty-seven white Carneau pi- 
geons were used. 


1An earlier version of this paper was 
presented at the May, 1958, meetings of the 
Midwestern Psychological Association in 
Detroit. The research was carried out while 
the first author was a research fellow of the 
United States Public Health Service. 

2 The authors wish to express their appre- 
ciation to Norman Guttman for his invaluable 
contributions to all stages of this research: 
The study was supported by Grant M-1002 
from the National Institute of Mental Health, 
U. S. Public Health Service and by the Re- 
search Council of Duke University. 

3 Now at Kent State University. 

4 Now at the University of North Carolina. 


Apparatus.—An automatic key-pecking 
apparatus fitted with a Bausch and Lomb 
monochromator was used. This apparatus 
is described in detail by Hanson (1957). 

Procedure—The Ss were _ individually 
caged and maintained ad libitum until their 
weight stabilized (approximately 7 days). 
They were then starved to 80% weight, 
magazine trained, and trained to peck at a 
key illuminated by a monochromatic light 
of 550 my. After 50 continuous reinforce- 
ments, 10 consecutive daily sessions of VI 
(variable interval) training were administered. 
The mean inter-reinforcement interval was 
approximately 1 min. Stimulus-on periods 
of 60 sec. were alternated with stimulus-off 
(“blackout”) periods of 10 sec. Thirty 
stimulus-on periods constituted a daily 
training session. 

The Ss were divided into four groups 
matched on the basis of number of responses 
on Day 10 of VI training. The mean num- 
ber of responses was 1279 for the group to be 
tested at 90% of ad libitum weight (Group 
90, N = 10). For Group 80 (N = 9), the 
mean number of responses was 1339; for 
Group 70 (N =9), M = 1491; and for 
Group 60 (N = 9), M = 1373. These dif- 
ferences are not significant (F < 1; 3, 33df). 

For 7 days after the completion of VI 
training, Ss were maintained at 80% weight, 
or were starved or fed to the level appropriate 
for their group. On Day 8 after the comple- 
tion of VI, all Ss were tested for stimulus 
generalization in extinction. Eleven different 
test stimuli were used (490 my to 610 mz 
by 10 my intervals, with the omission of 500 
my and 600 my). With the single exception 
that each test began with the CS, the 11 test 
stimuli were randomized within a series, and 
12 different random series were presented 
toeach S. This resulted in a schedule of 132 
stimulus presentations. Each stimulus pres- 
entation was for 30 sec. and was followed by a 
10-sec. blackout period, during which the 
number of key-pecks in the preceding period 
was recorded and the stimulus was changed. 

After an interval of one day to permit 
spontaneous recovery, a second generalization 
test was administered. 
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Fic. 1. Response strength as a function 


of body weight. 


Results and Discussion 


Although the relationship between 
response strength and hunger level 
is somewhat peripheral to the major 
purpose of this experiment, it may be 
of interest to examine this function 
before proceeding. Response rate at 
the start of the first generalization test 
(as determined from the first three 
presentations of all test stimuli) 
varies as an increasing monotonic 
function of deprivation level (Fig. 
1). The differences among groups 
are significant (/ = 3.89; 3, 33 df; 
01 < P < .025). 

Not only is the momentary rate 
a function of drive but also the total 
number of responses emitted during 
the 14 hr. test varies in a similar 
fashion (see Fig. 1). These differ- 
ences are also significant (F = 3.46; 
3, 33 df; 025 < P < .05). 

The mean generalization gradients 
obtained from the four drive groups 
are presented in Fig. 2. Though 
similar in general form, the gradients 
differ both in over-all height (level 
of responding) and in slope. The 
curves indicate that generalization, 
in terms of the total number of re- 
sponses given to non-CS values of the 
stimulus, increases monotonically with 
increases in drive level. 
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If the obtained curves are extra- 
polated in the direction of both 
higher and lower wave lengths than 
those sampled, it appears that the 
extent of generalization also positively 
varies with drive. In fact it appears 
likely that Ss at 60% weight would 
have responded with considerable 
strength to stimuli covering the 
entire range of the visual spectrum. 

To make the slopes of the obtained 
generalization gradients of the four 
groups directly comparable (i.e., to 
adjust for differences in level of 
responding), the gradient of each S 
on generalization Tests 1 and 2 was 
transformed into percentages of total 
response output to form what may 
be called gradients of relative generali- 
zation. In Fig. 3 are presented the 
four mean gradients in percentage 
form. Because these gradients are 
not symmetrical, slope differences for 
left and right sides were analyzed sepa- 
rately. The Alexander trend tests 
are presented in Table 1. The prob- 
ability levels for the individual F 
tests for differences between group 
slopes are presented in Table 2. 
These tests support the finding that 
the gradient of the 80% group is the 
steepest, that of .the 60% group 
flattest, while the’ 70% and 90% 
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Fic. 2. Stimulus generalization gradients of 
the four experiment groups. (Exp. I.) 
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TABLE 1 
ALEXANDER TREND TESTS OF RELATIVE GENERALIZATION GRADIENTS (Exe. 1) 
<550 my > 550 my 
Source df $$ —_——- — 
MS F MS F 
Over-all slope 1 7585.10 | 265.12* 5116.95 | 68.71* 
Over-all deviation from linearity 4 404.05 20.93* 149.53 9.25* 
Between group slopes 3 203.26 7.10* 265.75 SS eng 
Group deviation from linearity 12 39.39 2.02** 34.34 BS 
lok inten ks ok 
*P < 025. 
+ P< .05. 
group curves are intermediate in a high responding sample, while the 
slope. three lowest responders in these 


The generalization gradients ob- 
tained during spontaneous recovery 
were, on the whole, in agreement with 
those obtained on the first day of 
testing. The 60% group appeared 
to have the flattest gradient, while 
the other three gradients were quite 
similar to each other. Only the 
differences between the 60% curve 
and the 80% curve and the 60% and 
70% curves (for CS and above) 
were significant (P < .01). None of 
the other slope differences between 
groups reached significance. 

The finding of a relationship be- 
tween response strength and drive 
level suggests the possibility of a 
relationship between response strength 
and generalization slope. To check 
this possibility, the three highest 
responding Ss (on Test 1) in each 
of the five groups were pooled to form 


TABLE 2 


SIGNIFICANCE LEVELS OF INDIVIDUAL Group 
Store Comparisons (Exp. 1) 














<550 my > 550 mp 
80% 70% 60% 80% | 20% | 60% 
90% | <.05| N.S. <.025| NS|NS| NS. 
A | <.025| <.001,  |NS.| <.005 
70% | | <.05 NS. 


Note.—N.S. = P > .0S. 


groups were similarly combined to form 
a low responding sample. Though 
the high group responded five times 
as frequently as the low one, the 
relative generalization gradients of 
the two groups were quite similar 
(Fig. 4). There is little evidence 
in the present experimental situation 
for the hypothesis that relative gen- 
eralization slope is significantly re- 
lated to response strength. The ef- 
fects on generalization slope here 
attributed to manipulations in drive 
may therefore not be ascribable 
simply to changes in response strength. 
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5 Includes a group of Ss run in Exp. II. 
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of high responding and low responding 
groups of Ss. (Exp. I.) 


In a recent experiment similar in 
conception to the present one by Jenkins, 
Pascal, and Walker (1958), it was also 
found that generalization varies posi- 
tively with motivation level. However, 
these investigators attributed the ob- 
served effect to the intervening variable 
of response strength. They present 
evidence that in the experimental situa- 
tion which they employed, generaliza- 
tion slope varies reliably with response 
strength. They conclude that generali- 
zation slope varies with response strength, 
and with drive only indirectly. 

Jenkins et al., employed a somewhat 
different training and testing procedure 
from the present one, and what is 
probably more important, they used a 
different stimulus dimension, circle size. 
It has been shown here that with the 
spectral dimension the effect of response 
strength on relative generalization is 
negligible. It is reasonable to suggest 
that even in the Jenkins et al. study, dif- 
ferences in response strength may not 
account for all of the variance attributa- 
ble to the drive manipulation. Only 
research with different stimulus dimen- 
sions but with identical experimental 
procedures will indicate the role of the 
stimulus dimension in the relation be- 
tween generalization slope and response 
strength. 

EXPERIMENT II 


Hull's multiplicative hypothesis would 
require the gradients of relative generali- 
zation (as here defined) to be the same 
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for all drive groups. Méiller’s additive 
hypothesis would predict increasingly 
broader relative generalization gradients 
with increases in drive level. The 
results of Exp. I are clearly not in accord 
with either of these predictions. While 
the data are more clearly contradictory 
to the multiplicative hypothesis, the 
relatively flat gradient of the 90% group 
is inconsistent with both of the above 
conceptions. 

Another explanation of the present 
findings consistent with Hullian theory 
involves the notion of the drive stimulus 
and its generalization. ‘In the present 
study the steepest gradient was obtained 
with a group of Ss whose drive level was 
the same in testing as in training. The 
flattest gradient was obtained with the 
group which underwent the greatest drive 
change between training and testing. 
It is thus possible to hypothesize a 
flattening of the wave-length gradient 
as a function of drive stimulus changes. 
Such a notion would imply that when 
interoceptive stimulation is altered, ex- 
teroceptive stimulus changes become 
less important. 

To test this possibility Exp. II was 
performed. This involved the training 
and testing of a group of Ss at 70% of 
ad libitum body weight. The generali- 
zation gradient of this group may be 
compared with those of two groups in 
Exp. I, the group trained at 80% and 
tested at 70% (80 — 70), and the group 
trained at 80% and tested at 80% 
(80 —> 80). If the 70— 70 group pro- 
duces a gradient similar to that of the 
80 — 80 group, the differences between 
groups in Exp. I may be attributed to a 
change in the drive stimulus from train- 
ing to testing sessions. If the generaliza- 
tion gradient of the 70— 70 group 
resembles that of the 80 — 70 group, the 
proposed role of drive stimuli in this 
situation would be open to serious ques- 
tion. If the gradient of the 70 — 70 
group is intermediate between those of 
the 80 — 80 and the 80 — 70 groups, it 
would suggest that both drive stimuli 
and other characteristics of drive level 
have a role in determining relative 
generalization slope. 
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TABLE 3 
ALEXANDER TREND TESTS FOR RELATIVE GENERALIZATION GRADIENTS OF 
Groups 80, 80 anp 70, 70 (Exp. II) 
| < 550 my > 550 mu 
Source df -— — a 
MS F MS F 
Over-all slope 1 4921.23 130.02* 3559.61 59.53* 
Over-all deviation from linearity 4 234.82 9.70" 138.03 9.10* 
Between group slopes -_ 243.83 6.44** 312.67 5.23°*° 
Group deviation from linearity | + 48.99 | 2.02 | 76.09 5.02* 
AD gles Gebc te Te ee i ; 
+P < 025. 


Pp < 0S. 


Method 


Subjects.—Nine white Carneau pigeons 
were used. 

Apparatus.—The apparatus was the same 
as in Exp. I. 

Procedure.—The procedure was identical 
to that used in Exp. I with the exception that 
all nine Ss were trained at 70% weight, 
maintained for seven days after the comple- 
tion of VI training, and tested for generaliza- 
tion at this weight level. 


Results and Discussion 


The generalization gradient for the 
70 — 70 group is presented in Fig. 5 
with the 80—+ 70 and 80 — 80 gra- 
dients from Exp. I. The similarity 
between the 80-—+70 and 70—70 
gradients in Fig. 5 is striking. There 
is no significant difference between 
the slopes of these two curves. 
Table 3 presents the results of the 
analysis of the slopes of the 80 — 80 
and 70— 70 gradients. It is clear 
that these two curves differ signifi- 
cantly. 

The results of Exp. I] suggest that 
the slope of the generalization gradient 
varies with the drive level of S at the 
time of the test, and is independent 
of the drive level during training. 
Drive stimulus changes appear to 
have little or no effect on stimulus- 
generalization slope. 

In conclusion, it was found that the 
amount of stimulus generalization in 

‘ 


. 


terms of the number of responses given 
to non-CS test stimuli varies as a mono- 
tonic increasing function of level of 
motivation. In addition, the extent of 
generalization, i.e., the range of stimuli 
which will evoke an appreciable response, 
seems to vary positively with drive. 

Looked at in another way, however, 
the effect of drive is nonmonotonic. As 
hunger increases, the proportion of total 
responses given to non-CS values first 
decreases, then increases, with an ap- 
parent minimum at 80% of ad libitum 
weight. 

The results of the present experiment 
appear to call into question any position 
which views response strength as a 
single entity to be considered independ- 
ent of its various components. Perhaps 
habit and drive do not merely multiply 
with each other, but rather interact in 
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some way. Indeed, it may be that 
different habits are learned and/or per- 
formed at different drive levels (Bruner, 
Matter, & Papanek, 1955; Cotton, 1953). 
In any case, it appears reasonably 
certain that as far as stimulus generaliza- 
tion is concerned, an S with low drive 
and high habit strength performs in a 
manner predictably different from an S 
with a higher drive but a weaker habit.* 


SUMMARY 


The present investigation employed the 
key-pecking response in the pigeon to study 
stimulus generalization as a function of level 
of hunger. Four groups of pigeons were 
trained at 80% of ad lib. body weight to peck 
at a monochromatic stimulus, and then were 
tested at 90%, 80%, 70%, and 60% weight 
levels. It was found that in terms of abso- 
lute number of responses to stimuli other than 
the CS, or in terms of the range of stimuli to 
which S will respond, generalization increases 
as a monotonic function of deprivation. 
However, in terms of number of responses to 
non-CS stimuli relative to total responses, 
generalization first decreases, then increases, 
with a minimum at 80%. 

Over-all response strength (both in terms 
of initial rate during generalization testing 
and total responses during the 14 hr. test) 
appeared to be a negatively accelerated 
increasing function of weight loss. However, 
a comparison of the generalization gradients 
of high responding and low responding Ss 
indicated that the effects of deprivation on 


®A theoretical treatment of this problem 
which is in some respects consistent with the 
present findings has recently been offered by 
Estes (1958). 
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generalization cannot be accounted for on 
the basis of changes in response strength 
alone. 

In a second experiment to check on the 
role of the drive stimulus, it was shown that 
the slope of the generalization gradient varies 
with the drive level at the time of the test, 
and is independent of the drive level during 
training. 

These results appear to call into question 
any position which views response strength 
as a single entity, to be considered inde- 
pendent of its associational and motivational 
components. 
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THE RELATION BETWEEN PHYSIOLOGICAL MEASURES 
OF EMOTIONALITY AND PERFORMANCE IN 
EYELID CONDITIONING! 
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The basic theoretical framework 
within which recent studies of classical 
defense conditioning have been con- 
ducted assumes that drive level (D) 
and associative strength (7) combine 
multiplicatively to determine excita- 
tory potential (£), which in turn 
determines the level of performance. 
It has further been assumed that 
drive level (D) in these studies is a 
function of a persisting emotional 
response (r.) which is aroused by 
noxious stimulation and which, ac- 
cording to some neurophysiological 
evidence (Lindsley, 1951), may have 
a facilitative effect on cortical mech- 
anisms. Since this mechanism (r,) 
is assumed to have the properties of a 
response, its magnitude should be a 
function of those experimental vari- 
ables which determine the strength 
of overt responses to noxious stimu- 
lation. This reasoning has led to 
the hypothesis that drive level (D) 
is a function of (a) the intensity of the 
noxious UCS used in conditioning, 
and (b) the score on a questionnaire 
designed to measure individual dif- 
ferences in emotionality, the A-Scale 
(Taylor, 1953). Confirmations of pre- 
dictions based on this hypothesis and 
the other parts of the theory of 
conditioning given above have pro- 


! This study, conducted under the general 
direction of K. W. Spence, is part of a project 
concerned with the influence of motivation 
on performance in learning under contract 
N9 onr-93802, Project NR 154-107 between 
the State University of lowa and the Office 
of Naval Research. 

2 Now at Hobart College, Geneva, New 
York. 


vided a good deal of support for the 
theory. This evidence has been ex- 
tensively summarized by Spence (1956, 
1958) and Spence and Ross (1957). 

In addition to the use of the A- 
Scale, another method of selecting Ss 
differing in emotionality is suggested 
by the theory. There are a number of 
so-called physiological responses such 
as the GSR, pulse rate, respiration 
rate, skin temperature, etc., which 
frequently have been used to define 
emotionality. The hypothesis may 
thus be formulated that the magni- 
tude of these responses is some 
increasing function of the intensity 
of the hypothetical emotional response 
(r.). If drive level is determined by 
the intensity of r., Ss who give large 
physiological responses to noxious 
stimulation should have a higher level 
of drive (D) than Ss who give small 
responses to this stimulation, and 
hence should exhibit a higher level 
of performance in eyelid conditioning. 

In a preliminary study (Runquist 
& Ross, 1958) pulse rate changes 
and GSR responses to a weak air 
puff were measured, and Ss were 
classified as emotional or nonemo- 
tional on the basis of these measures. 
While the emotional Ss showed a 
significantly higher level of _ per- 
formance in eyelid conditioning than 
the nonemotional Ss, certain diffi- 
culties involved in the measurement 
of the physiological responses made 
a repetition advisable under condi- 
tions which would give more reliable 
measures. 

This study was designed to deter- 
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mine individual differences in emo- 
tional responsiveness to weak noxious 
stimulation, and to ascertain the 
relation of this level of responsiveness 
to level of performance in eyelid 
conditioning. 


METHOD 


The experiment consisted of two parts, 
both conducted during the same session. 
The first part was the testing period, the 
second part the conditioning period. 

Test period.—Measures of each S's pulse 
rate change and palmar skin conductance were 
taken in response to air puffs of medium 
intensity (.6 psi) delivered to the right eye 
at roughly 1-min. intervals. No warning 
signal was given. No measurements were 
made for the first five trials, since most Ss 
gave a large skeletal reflex which interfered 
with accurate recording of the physiological 
responses. This could have been avoided by 
using a warning signal; however, most Ss 
showed a response to any kind of warning 
signal which differentially confounded the 
magnitude of response to the air puff. With 
the equipment available simultaneous record- 
ing of both pulse rate and skin conductance 
was not possible. Therefore, pulse rate was 
recorded on the second five air puff trials 
and skin conductance on the following five. 
The measures obtained are thus confounded 
with differential adaptation; however, the 
effects of such adaptation would be to reduce 
individual differences and lessen the likelihood 
of obtaining the predicted results. Pulse 
rate and skin conductance were measured 
for a period of 15 sec., starting 5 sec. before 
the air puff. 

Apparatus.—During the test period S 
was seated in the eyelid conditioning room. 
In addition to the regular equipment for 
presenting the UCS and recording eyeblinks 
(Spence & Taylor, 1951), one standard EKG 
electrode was attached to each forearm, and 
a pair of GSR electrodes consisting of lucite 
cups j in. in diameter with a zinc plate at the 
bottom were placed on the back and palm of 
the right hand. The cups were filled with a 
cellulose sponge saturated with 10% zinc 
sulfate electrode paste. 

Skin conductance. was measured by the 
exosomatic method. The amplifier was a 
modified version of a simple bridge circuit 
described by Haggard and Gerbrands (1947). 
A measure of base level conductance was 
available on each trial. The EKG leads led 
to a standard EKG amplifier, the output of 
which led to a Brush ink writing oscillograph. 


W. N. RUNQUIST AND L. E. ROSS 


Each pulse beat resulted in a vertical mark 
on a continuously moving paper. 

Conditioning.—\Immediately following the 
test period, eyelid conditioning was begun. 
Each S received 80 conditioning trials pre- 
ceded by three trials with the CS alone. The 
CS was the increase in brightness of a 6-cm. 
circular disc. The UCS for conditioning was 
a 1-lb. air puff delivered to S's right eye. 
The CS was 550 msec. in duration with the 
UCS occurring 500 msec. after its onset. 
The duration of the air puff was limited to 
50 msec. by a 60-cycle 110-v. AC-operated 
solenoid valve controlled by an electronic 
timer. The intertrial interval was either 15, 
20, or 25 sec. in a prearranged order. A 
verbal ready signal preceded each trial by 
from 2 to 4 sec. The Ss were instructed to 
blink and fixate the CS upon presentation of 
the ready signal. A CR was recorded when 
the record showed a deflection of 1 mm. in 
the interval 200-500 msec. following the 
onset of the CS. 

Subjects—The Ss were 90 volunteers from 
introductory psychology courses. An addi- 
tional 32 Ss were run but were discarded. 
Apparatus difficulties were responsible for 
20, eight were classified as voluntary re- 
sponders (Spence & Taylor, 1951), and four 
made two initial CR’s to the light. All Ss 
had previously taken the A-Scale. 

Measurement of emotionality—For each 
test trial, base level pulse rate was taken as 
the speed of the last four beats preceding the 
air puff, and skin conductance was measured 
immediately preceding the air puff. The 
response recorded was the fastest four beats 
and the highest conductance in the 10-sec. 
period following the air puff. 

For the purposes of the main analysis, 
one response measure in each system was 
chosen: median pulse rate change, and change 
in base level conductance over the five test 
trials. The first measure was suggested by 
the previous study (Runquist & Ross, 1958) 
and the second by Schlosberg (1954). 


RESULTS 


Heart rate responses ranged from 
44 beats/min. to minus 9 beats/min. 
with a mean change of 12.0 beats/ 
min., while base level changes ranged 
from 30,000 ohms to minus 12,000 
ohms with a mean change of 5800 
ohms. 

Two different methods of selecting 
emotional and nonemotional Ss were 
employed. One involved converting 
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the median pulse rate change and 
change in base level conductance to 
z scores and then summing these to 
obtain an emotionality index. Using 
this method, the 20 Ss with the highest 
scores and 20 Ss with the lowest 
scores were compared on performance 
in conditioning. Curves of the ac- 
quisition of the CR in terms of per- 
centage CR’s in each block of ten 
trials are shown as the two solid 
curves of Fig. 1. The emotional Ss 
it will be seen, exhibit a consistently 
higher performance level over the 
course of conditioning. A Mann- 
Whitney test on the total number of 
CR’s over the last 40 trials gave a z 
value of 1.56 which has a one-tail 
probability of .06. These results 
were obtained with an equal weighting 
of the two measures. Undoubtedly 
better prediction could be obtained 
by multiple regression analysis. 

The second method of selecting Ss 
was suggested by some recent work on 
autonomic responses. Lacey (1956) 
has advanced the hypothesis that 
emotionality is response specific for 
each individual. That is, some peo- 
ple respond to noxious stimulation 
with an increase in pulse rate, others 
with an increase in skin conductance, 
others with respiratory changes, etc. 
This would lead to low correlations 
between the physiological measures 
and might lead to a low correlation 
between any single measure and per- 
formance in conditioning. The corre- 
lation between the two physiological 
measures was low (r = .17) as were 
the correlations between these two 
measures and conditioning (the r for 
pulse rate change and conditioning 
was .12, and for base level conduct- 
ance change and conditioning was 
.29). 

With this hypothesis in mind, two 
groups were selected as follows: An S 
was classified as emotional if he was 
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Fic. 1. Percentage of CR's in blocks of 
10 trials for groups of Ss selected by combina- 
tions of physiological measures. 


in the top 10% of the distribution on 
either pulse rate change (those show- 
ing the largest increase) or skin 
conductance base level change (those 
showing the largest increase). Eight- 
een Ss fit into this category. The 
nonemotional group was comprised 
of the same 20 Ss as the nonemotional 
group in the previous comparison. 
The curve of the acquisition of the 
CR is shown as the dotted line in Fig. 
1. The emotional group is clearly 
superior to the nonemotional group 
in level of performance. A Mann- 
Whitney test of the total CR’s in the 
last 40 trials gave a value of z = 2.09 
which is significant at the .02 level. 
It appears, moreover, that selection 
of the extreme groups by this method 
results in greater differences in per- 
formance than the method of sum- 
ming measures. The emotional Ss 
selected on the single measure cri- 
terion exhibited superior performance 
to that of the emotional Ss selected 
by the summed scores, although there 
is no way to evaluate this statistically 
since some of the same Ss appear in 
both emotional groups. 

Since scores on the A-Scale were 
available for each S it was possible 
to make a determination of the rela- 
tion between A-Scale score and physi- 
ologically defined emotionality. A 





332 


Pearson r was computed between each 
S’s A-Scale score and the larger of 
his two physiological measure z scores. 
The resulting 7 of .22 was significant 
at the .05 level. 


DISCUSSION 


These results which show that emo- 
tional Ss defined by a_ physiological 
index of emotionality exhibit a higher 
level of conditioning than nonemotional 
Ss are in agreement with the hypothesis 
which relates drive level (D) to the 
intensity of the emotional response (r,). 
The results further support, although 
not as clearly, the hypothesis that there 
are individual differences not only in the 
level of emotionality, but in the par- 
ticular response system which is acti- 
vated. 

In the preliminary study, it was found 
that the GSR response to the air puff 
was a fairly good predictor of condi- 
tioning performance, while in the present 
study it was not. The apparent con- 
tradiction is not surprising if one con- 
siders that each S received ten presenta- 
tions of the air puff in this study before 
any GSR measures were taken. The 
effect of these trials would be to reduce 
any individual differences in the magni- 
tude of the GSR response which were 
initially present. 

The possibility should be considered 
that the changes in the physiological 
measures reflect skeletal startle responses 
to the air puff rather than emotional 
responsiveness. If such were the case 
a general drive theory would still 
predict the obtained conditioning, physi- 
ological index relationship since the 
magnitude of the startle reflex would 
also be assumed to reflect drive level (D). 


SUMMARY 


Ninety Ss were given 15 air puff alone 
trials, on the last 10 of which pulse rate 
changes and skin conductance changes were 
measured. All Ss were then given 80 condi- 
tioning trials. ‘Two groups of Ss were selected 
from the total sample and their conditioning 
performance compared. One group, the 
nonemotional group, were the 20 Ss with the 
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lowest total combined z score on the two 
physiological measures. Emotional groups 
were defined in two ways: One emotional 
group consisted of the 20 Ss with the highest 
total z scores. Another emotional group 
was formed by taking the 10% Ss who showed 
the greatest pulse rate increase, and the 10% 
who showed the greatest skin conductance 
increase. The difference in conditioning 
performance between the nonemotional group 
and the emotional group selected by combin- 
ing z scores approached significance, while 
the emotional group selected on single 
measures showed significantly higher per- 
formance than the nonemotional group. The 
results were interpreted as supporting a 
hypothesis which relates drive level (D) 
to emotional responsiveness. 
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SET TO RESPOND AND THE EFFECT OF INTERRUPTING 
SIGNALS UPON TRACKING PERFORMANCE ! 


STEPHEN GRIEW 
Unit for Research on Employment of Older Workers, University of Bristol 


It has now been demonstrated 
beyond much doubt that a delay 
will occur in making a response to a 
signal which occurs very shortly 
after another to which a response has 
to be made (Craik, 1948; Davis, 1957; 
Hick, 1948; Telford, 1931; Vince, 
1949). Much of the work from which 
this conclusion has been drawn has 
been reviewed by Welford (1952) 
who also discusses the concept of 
“psychological refractoriness’ which 
has arisen from it. This concept has 
been employed in the interpretation 
of a considerable range of high-speed 
performance (e.g., Hick, 1948; Hick 
& Bates, 1950; Welford, 1958) in- 
cluding tracking. 

On the basis of the refractoriness 
hypothesis, one would predict that in 
a situation in which S is required 
to track continuously and at the 
same time to respond to irregular 
interrupting signals there will be 
some interference between the two 
sets of incoming signals, such that 
performance at both tasks will de- 
teriorate. The first experiment to 
be reported below was designed to 
test this prediction. The aim of the 
second experiment was to clarify 
a suggestion arising from the first 
that the interference of signals from 
two sources is not the only determinant 
of performance decrement in tasks 
of this type. 


!This research was part of a program 
financed by the Nuffield Foundation, London. 
The author is grateful to the Directors and 
Management of the Mond Nickel Company, 
Limited, for prov iding Ss, and to the Ss for 
their cooperation. 


EXPERIMENT | 


In an attempt to test the prediction 
outlined above, performance at a 
pursuit tracking task alone was com- 
pared with performance at a pursuit 
tracking task undertaken at the same 
time as responding to interrupting 
auditory signals. Also, efficiency at 
responding to interrupting auditory 
signals while maintaining continuous 
tracking was compared with efficiency 
at responding to auditory signals 
when tracking was not in progress. 
Interrupting signals were auditory 
in order to ensure that any inter- 
ference which occurred would be 
centrally based. 


Method 


A pparatus.—The apparatus was a slightly 
modified version of that used by the author 
(Griew, 1958) in a study of age changes in 
pursuit tracking involving interrupted pre- 
view. A complex harmonic course, drawn 
on a band of paper, moved downward past 
a window placed perpendicularly to S's line 
of sight. At the point at which it disappeared 
from view, a pen was placed. This pen could 
be moved across the paper by means of a 
handwheel. Preview of the course to be 
followed was given above the point at which 
matching took place. The S tracked the 
course by operating the handwheel with his 
left hand, and at the same time kept his right 
forefinger above a key placed conveniently 
on the right of the apparatus. This key 
was pressed immediately upon the occurrence 
of a signal from a buzzer operated by E. 
The signal from the buzzer was extinguished 
when the key was pressed. The tracking 
course took the form of 94 “‘swings"’ and did 
not repeat. It was the same at every track- 
ing condition for all Ss. Interrupting audi- 
tory signals occurred at random time intervals 
apart, the intervals being the same for each 
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S, and ranging from 3 to 12 sec., with a mean 
of about 5 sec. 

Subjects.—There were 20 Ss. They were 
industrial workers, randomly drawn from a 
population matched for occupation and 
“background.” They were all right-handed. 
Their ages ranged from 21 to 50 yr. There 
were no age differences in efficiency of re- 
sponding to auditory signals, and such age 
differences as were observable in performing 
the tracking task did not affect the issue 
being considered here; hence, the results of 
Ss of all ages are taken together. 

Design.—F ive experimental conditions were 
studied. These were: 


Condition A: Responding to auditory 
signals alone. Here, S responded to a series 
of nine auditory signals given irregularly 
over the course of 45 sec. 

Condition T,: Performance at the pursuit 
tracking task alone at a course speed of 4.50 
cm./sec. This gave a course duration of 64 
(+1.20) sec. 

Condition T,: Performance at the pursuit 
tracking task alone at a course speed of 9.00 
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TABLE 1 
TRACKING Scores: Exp. I 





Mean Timing 
Error (Sec.) 


O1 .23 


02 (P = ns) ‘27 (P = .05) 
—.05 41 


~ os) (P = ns) | “4) (P = .002) 


Mean Amplitude 
Error (Cm.) 











cm./sec. This gave a course duration of 32 
(+0.60) sec. 

Condition T,A: Pursuit tracking at 4.50 
cm./sec. and responding at the same time to 
12 interrupting auditory signals. 

Condition T:A: Pursuit tracking at 9.00 
cm./sec. and responding at the same time to 
seven interrupting auditory signals. 

At each tracking condition, a 4.50 cm. 
preview of the course was given. Before 
performing for experimental purposes, all Ss 
performed at each experimental condition 
twice for practice purposes. The orders in 
which Ss attempted the five conditions were 
represented by the rows of a 5 X 5 latin 
square, replicated three times. Reaction 
times to auditory signals were recorded by 
the movement of a pen on the moving band 
of paper upon which the tracking course was 
drawn. Mean timing and amplitude errors 
in tracking were calculated in the manner 
described previously (Griew, 1958). 


Results and Discussion 


Mean timing and mean amplitude 
error scores in tracking are shown in 
Table 1. Table 2 shows mean audi- 
tory RT’s with associated SD’s. 
The distributions of auditory RT’s 
generating these means are shown 
in Fig. 1. 

Summarizing these results, it may 
be said (a) that the introduction of 
interrupting auditory signals has had 


TABLE 2 
MEAN AupitTory RT's: Exp. I 


Cond. | Mean Auditory RT (Sec.) | SD 
A 
TiA 
TiA 
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no effect upon timing errors in track- 
ing, (b) that the introduction of 
interrupting auditory signals has re- 
sulted in increases of amplitude error 
in tracking at both speeds, with a 
substantially greater increase at the 
higher speed, (c) that efficiency of 
responding to auditory signals has 
decreased upon the introduction of 
tracking at the same time, and (d) 
that the decreased efficiency of re- 
sponding to auditory signals while 
tracking is accompanied by different 
distributions of auditory RT's at 
Cond. T,;A and TeA. In Cond. T,A, 
the distribution is very similar to 
that in Cond. A, but shows a higher 
mean, while in Cond. T:A both a 
higher mean again, a much higher 
SD and a slightly bimodal distribution 
are shown. 

The scatters of individual auditory 
RT’s about their means in Cond. A 
and T,A are so similar as to suggest 


that individual responses to inter- 
rupting signals are all being slightly 


delayed in the second. The bimodal 
distribution in Cond. T:,A suggests 
that the additional increase in mean 
RT in this instance is due to a much 
greater delay in a proportion of the 
responses to interrupting signals. 


It is probably true that in higher 
speed tracking, interrupting signals are 
more likely to occur at a time when 
signals from the track (i.e., changes of 
direction) are being dealt with, and that, 
in consequence, their responses are 
likely to be sometimes greatly delayed. 
These delays are probably reflected in 
the cluster of high RT’s in the bimodal 
distribution of Cond. T,A results. The 
absence of high RT’s in Cond. T,;A 
suggests that responses to auditory 
signals are probably tending not to be 
delayed because of interference by signals 
from the track. The general increase 
in auditory RT’s in this condition is 
difficult to explain without recourse 
to a concept of “set to respond”’ pri- 
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marily to signals from the track, with 
its implication that a greater degree of 
attention would be given by Ss to the 
track, with the result that responses 
to auditory signals would, in general, 
all be slightly delayed. This would also 
account for the general increase in 
auditory RT in Cond. T:A which have 
not been delayed by signals from the 
track. 

On the basis of the foregoing discus- 
sion, two predictions may be made. 
First, at the lower speed, tracking per- 
formance would be expected to deteri- 
orate only very slightly. This is be- 
cause, with the possibility of anticipa- 
tion given by a preview of 1 sec., and 
with attention being directed mainly 
at the track, it is unlikely that interrupt- 
ing signals would interfere with indi- 
vidual responses to signals from the 
track. A nonsignificant increase in mean 
timing error, and a small but significant 
increase in mean amplitude error are, in 
fact, observed at the lower speed (see 
Table 1). 

Secondly, at the higher tracking 
speed, the probability of signals from 
the track arriving while interrupting 
signals are being dealt with is higher. 
If a signal from the track arrives while 
a response to an interrupting signal is 
being initiated, it is likely that the track 
signal will not be dealt with until its 
response is almost due. By the time 
the response is made, it may be that the 
signal will have disappeared from view, 
and errors in matching would not then 
be adjustable. This should result in a 
much higher mean amplitude error than 
otherwise would have been the 
The results given in Table 1 
this expectation. Further, gross errors 
in matching amplitude of course “swings” 
would be expected when interrupting 
signals had just occurred. 

In order to test this last prediction, 
the amplitude error exhibited at each 
course ‘“‘swing’’ was divided by the 
amplitude of the “swing,”’ to give an 
indication of S’s accuracy in matching 
amplitudes. This was done for each S, 
at each course “‘swing’’ at each tracking 


case. 
confirm 
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TABLE 3 


THe COINCIDENCE OF INTERRUPTING SIGNALS 
AND Gross ERROR IN MATCHING 
AMPLITUDE OF COURSE 
“Swincs” (N = 17) 


Number of 
Gross Errors 
Coincident 
With Inter- 
rupting 
Signals 


Number of 
Gross Errors 


Cond. in Matching 

















condition. The results of this analysis 
are shown in Table 3. 

a 
A 
amplitude error to amplitude of course 
“swing.’’ High values of this expression, 
which indicate gross inaccuracies in 
matching amplitudes, occur most fre- 
quently in Cond. T:A, and a high pro- 
portion of them coincide with points 
at which interrupting signals occur. 


In this table, indicates the ratio of 


EXPERIMENT II 


Although, on the basis of the 
results of Exp. I, it seems unlikely 
that the small increase in amplitude 
error in lower speed tracking with 
interrupting signals is due to inter- 
ference between signals from the 
two sources, no indication is readily 
available as to the actual determinant 
of this small performance decrement. 
The most likely explanation is that 
there exists a secondary “set to 
respond”’ to interrupting signals which 
results in the occasional fluctuation 
of attention from the course to be fol- 
lowed to the source of interrupting 
signals. To test the hypothesis that 
a secondary ‘‘set” exists, tracking 
performance alone was compared with 
tracking performance in which Ss 
were warned to expect interrupting 
signals but in which no interrupting 
signals were, in fact, given. 
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Method 


Apparatus and subjects—The apparatus 
was the same as that used in Exp. 1. Sixteen 
Ss took part. They were all right-handed, 
and were drawn from a similar population 
as were Ss in Exp. I. Their ages ranged 
from 25 to 47 yr. Such age differences as 
existed did not affect the issue under con- 
sideration and so the results of all Ss are 
taken together. 

Design.—Four experimental conditions 
were studied. They corresponded to Cond. 
Ti, Tz, T:A, and TA of Exp. I. The first 
two were identical to T,; and T:, representing 
tracking alone, and will be similarly desig- 
nated T, and T;. The last two differed from 
T,A and T,A in Exp. I in that, although Ss 
were warned to expect occasional interrupting 
signals, only one occurred in each of the corre- 
sponding conditions in this experiment. This 
was given, in both conditions, 12 sec. from the 
start of trials, and was intended to convey to 
Ss the impression that interrupting signals 
were, in fact, a serious possibility. The 
conditions corresponding to T,A and T:A 
will be designated Cond. T,A’ and T,A’. 
in both experiments, instructions given to Ss 
were identical in the corresponding experi- 
mental conditions. 

Before performing for experimental pur- 
poses, two practice trials at each condition 
were given to each S. During practice trials 
at T,A’ and T,A’, four interrupting signals 
were given, again to reinforce the impression 
that they were a serious possibility. The 
orders in which Ss attempted the four condi- 
tions were represented by the rows of a 
4 X< 4 latin square, replicated three times. 
Two Ss were drawn from each of eight 
departments of the factory in which the 
study was undertaken, and Ss from the same 
department performed immediately following 
one another, in order to minimize discussion 
of the experiment. 


Results and Discussion 


Table 4 shows mean timing and 
mean amplitude errors at the various 
conditions. 

This experiment is concerned basi- 
cally with the effect on pursuit track- 
ing performance of the possibility of 


interrupting signals. It is clear that 
this possibility does not affect timing 
errors. The increase in mean ampli- 
tude error at the lower course speed 
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TABLE 4 
TRACKING Scores: Exp. II 


Mean Timing 
Error (Sec. cs 


Mean Amplitude 
Error (Cm.) 





28 (P = .05) 
r (P = .05) 


TAY a) (P = ns) ) | 25 


TAY Bs = 05, (P = ns) 


when interrupting signals are possible 
is similar to that reported in the first 
experiment when interrupting signals 
were, in fact, given. At the higher 
tracking speed, the increase due to 
the possibility of interrupting signals 
is only about one-third of that ob- 
served in the previous experiment 
to be associated with the inclusion 
of interrupting signals. 

It seems fairly clear that the “‘set 
to respond” to interrupting signals 
is sufficient to result in a decrement 
in pursuit tracking performance, which, 
while being slight, is nevertheless 
significant. 


SUMMARY 


Two experiments are reported. The first 
is concerned with tracking and responding 
simultaneously to interrupting auditory sig- 
nals. Deterioration in efficiency at both 
tasks is observed when they are undertaken 
simultaneously. The manner in which these 
results are generated is discussed, and they 
are explained in terms (a) of the interference 
of signals from the two sources at the higher 
tracking speed and (6) of a “set to respond” 
primarily to signals generated by the tracking 
course. A subsidiary “set to respond” to 
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interrupting signals is suggested by a slight 
but significant decrement in tracking per- 
formance at the lower speed. Experiment I! 
investigates this by comparing tracking alone 
with tracking with the possibility of inter- 
rupting signals but when none are in fact 
given. A slight but significant decrement in 
tracking suggests that a “set to respond” 
is responsible for a proportion of performance 
decrement in tasks where interrupting signals 
are given. 
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EFFECTS OF INVERSION OF THE VISUAL FIELD ON 
HUMAN MOTIONS! 
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The effects of inversion of the visual 
field were first investigated system- 
atically by Stratton (1896, 1897, 
1899), who used himself as a subject. 
Stratton reported especially on the 
difficulties of writing during inversion 
of the visual field. Recently interest 
has been revived in the problem by 
the work of Kohler? (1953), who has 
claimed that some of the difficulties 
in earlier studies can be overcome by 
requiring Ss to wear reversing or 
inverting optics for many weeks 
rather than for only a few days. 
Kohler’s work adds to earlier objec- 
tive observations of Brown (1928), 
Ewert (1930, 1936), and Wooster 
(1923) in suggesting that readjust- 
ment to perceptual reversal is progres- 
sive, and is dependent on prior motor 
readjustment. In addition, studies 
have been conducted by Foley (1940), 
Hess (1956), and Kohler (1953) on 
the monkey and on the chicken, which 
suggest that readjustment to inver- 
sion or displacement of the visual 
field may be more difficult for lower 
animals, particularly the chick, than 
it is for man. 

The one general finding which has 
come out of the human studies is 
that S can adjust somewhat both 
perceptually and motor-wise either 
to inversion or to right—left reversal 
of the visual field. The studies of 


' This research was supported by funds 
supplied by the National Science Foundation 
for a project on perception and human motion 
and in part by funds assigned by the Graduate 
School, University of Wisconsin. 

*We are indebted to J. J. Gibson for 
supplying us with a mimeographed transla- 
tion of the work of Kohler & Erisman. 


both Ewert (1930, 1936) and Kohler 
(1953) suggest that this adjustment 
is not general, but is specific to task 
situations—that visual, tactual, and 
kinesthetic perception becomes con- 
sistent only after practice in and 
during performance of particular ma- 
nipulative acts with objects or situa- 
tions. The visual field is reorganized 
in relation to specific situations and 
performance in these situations. Ac- 
cording to Kohler (1953), skill in 
performance must be achieved with 
the distorted vision before the new 
perceptual organization takes place. 
Although Kohler implies that per- 
formance in reversed or inverted 
visual fields may be equal to that 
found in normal vision, this conclu- 
sion is not clearly established by his 
or any other investigations. The 
effects of inversion may persist for a 
short time after the inverting optical 
systems are removed from the eyes. 

This study is an attempt to develop 
new approaches to the problem of 
perceptual inversion. Instead of de- 
pending on subjective judgment of Ss, 
scientific methods of motion analysis 
are used to assess performance with 
inverted vision. We have also at- 
tempted to achieve a relatively com- 
plete experimental design in observing 
important variables related to inver- 
sion. That is to say, a systematic 
theory has been used to plan the 
determination of the effects and 
interactions of task complexity, in re- 
lation to various conditions of visual 
and kinesthetic inversion. Finally 
we have eliminated secondary sources 
of optical error usually occurring in 
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inversion experiments due to the 
wearing of lenses or prisms. The Ss 
in this study viewed an inverted 
visual field through a fixed prism 
which produced no distortion of the 
visual field nor any movement of the 
visual field when the head is moved. 
On the basis of the general theory 
of human motions used in planning 
the study, certain predictions were 
made regarding the relative effects 
of kinesthetic inversion and visual 
inversion on the travel and manipula- 
tive movements of handwriting in 
handwriting tasks of different com- 
plexity. 


METHOD 


Apparatus.—In order to measure per- 
formance under the conditions of perceptual 
inversion, use was made of recently developed 
electronic methods of motion analysis (Smith 
& Bloom, 1956; Smith & Wehrkamp, 1951). 
Figure 1 illustrates the application of these 
methods to measurement of handwriting, as 
investigated in this study. The S is seated 
in front of an adjustable writing board over 
which a viewing aperature is mounted. He 
peers through this aperature in order to 
observe the movements of his hand in writing. 
To invert the visual field, a long right-angle 
(tank-type) prism is inserted in the aperature 
and S is required to view his performing hand 
through this fixed prism. A clear undistorted 
inverted visual field is obtained which does 
not move when S moves his head or eyes. 

The S writes three separate letters or 
patterns—a series of dots (....), a series of 
triangles (A A A A), and a series of “a's” 
(aaaa). The letters and patterns are 
written within a marked-off band of electri- 
cally conductive paper (Teledeltos* recording 
paper) which can be fed through the paper 
drive of the writing board. A_ metallic 
pencil containing electrographic lead is used. 

As diagrammed in Fig. 2, a subthreshold 
current which passes through S’s body is 
keyed on and off by the contact of his pencil 
with the writing surface. Each time the 
pencil touches the paper it closes a relay 
circuit which operates a precision time clock 
and thus records the duration of this contact 

* The Teledeltos Recording Paper used in 
this research is manufactured by the Western 
Union Telegraph Co. 


wave 
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Fic. 1. The arrangement of the com- 
ponents of the electric handwriting analyzer. 
The writing apparatus, along with the 
removable panel containing the prism, is 
shown to the left. The relay and timing 
apparatus, shown to the right, are located 
on the opposite side of the experimental 
room. 


(manipulative) movement in writing. When 
the contact is broken by movement of the 
pencil to the next letter or pattern, the relay 
automatically activates a second clock which 
records the duration of this travel movement. 
As S writes each letter or pattern in a series, 
the apparatus automatically times separately 
and sums the two component movements 
of contact and travel. When the pencil 
comes to the edge of the paper, S touches a 
stop plate with his pencil and automatically 
stops the timing of the series of movements. 


MANIPULATION 
TIME 


TRAVEL 
TIME 
































Fic. 2. 


The principle of the electronic 
handwriting analyzer. 
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Procedure.—F our conditions of visual and 
kinesthetic perception were used in the 
investigation—normal vision and normal 
kinesthetic orientation (NV-NKF), normal 
vision with inverted kinesthetic feedback 
(NV-IKF), inverted vision with normal 
kinesthetic feedback (IV-NKF), and inverted 
vision with inverted writing (IV-IKF). 
Four groups of 6 Ss each practiced 10 days 
in writing the three patterns of different 
complexity: dots, ‘‘a’s,” and _ triangles. 
University students were used as Ss and were 
rewarded for their performance by being 
assigned 10 points toward their final grade 
in elementary psychology. 

On each day each of the patterns was 
written three times in three successive trials. 
Eight characters were written per trial. 
Practice effects were counterbalanced by 
having each S write the three symbols in a 
different sequence, ‘such as dot, triangle, 
“a” or triangle, dot, “a.” All six possible 
sequences were used for each of the four 
groups of six Ss. 

In treating the data, the median contact 
time and median travel time of the three 
practice trials for each symbol on each day 
were used as the basis of all computations. 
Comparisons were then made between the 
performance of each group of Ss in the vari- 
ous experimental conditions on all of the 
symbols written or drawn. Analyses of 
variance of Day 1 performance and of Day 
10 performance on each symbol and for 
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RESULTS 


Figures 3 and 4 describe the facts 
about adjustment to different condi- 
tions of perceptual inversion. Figure 
3 gives the data for contact move- 
ments in the three task situations, 
and Fig. 4 the data for travel move- 
ments. The curves represent the 
duration of the component movements 
on successive days of practice. The 
parameter in each set of curves is the 
four conditions of perception used— 
NV-NKF, NV-IKF, IV-NKF, and 
IV-IKF. These curves show that 
the different conditions of perceptual 
inversion produced systematic varia- 
tions in the duration of the contact 
and travel movements in writing. 
These variations are especially note- 
worthy in the writing of ‘“a’s” and 
triangles. The effects of inversion 
in tasks of different complexity are 
clearly seen. There are only limited 
effects of inversion on the writing 
of dots, an intermediate effect on 
writing ‘‘a’s,”’ and a maximum effect 
on writing A’s. 

These curves suggest that, initially, 
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Learning curves for manipulative (contact) movements in writing during 


readjustment to inverted}perceptual fields. 
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Fic. 4. Learning curves for travel movements in writing during readjustment 
to inverted perceptual fields. 


than it is for manipulative move- 
ments. The curves for travel move- 
ments in the triangle task show rela- 
tively faster initial drop than those 
found for manipulative movements. 
Initially, the travel movements for 


the dotting task are more affected 
by the inversion than the contact 
movements. 

There is an orderly variation of the 
effects of the four different conditions 


of inversion on both the contact 
and travel movements. The com- 
bined visual-kinesthetic inversion (IV- 
IKF) produces the most serious 
disturbance of movement with less 
serious effects appearing in the more 
limited conditions of inversion (NV- 
IKF and IV-NKF). Inverting the 
kinesthetic feedback of the writing 
motions (NV-IKF) has less effect 
on the movements than inverting 
the visual field (IV-NKF). 

The significance of differences due 
to perceptual conditions has been 
assessed by means of analysis of 
variance* and Duncan (1955) Range 

‘Six tables showing means of median 


manipulation and travel times by days (10), 
conditions (4), and task (3) and summaries 


tests made on the data of Day 10. 
Separate analyses were carried out 
for the contact and travel data. 
Results of these tests show that 
significant differences in performance 
exist especially between the condition 
of combined visual and kinesthetic 
inversion (IV-IKF) and the other 
conditions of perceptual inversion. 
For both contact and travel move- 
ments, significant differences in writ- 
ing “a’s’’ and triangles are found 
between the conditions of combined 
inversion (IV-IKF) and inverted writ- 
ing (NV-IKF). Generally on this 
tenth day significant differences do not 
occur between the two conditions of 
visual inversion. 

Similar range tests carried out for 
the data on Day 1 give results some- 
what different from those for Day 10. 
For Day 1 manipulation times, dif- 
ferences are found mainly between 
the combined visual-kinesthetic in- 
version and the other three conditions. 
For Day 1 travel times, differences 


of analyses of variance and trends are on file 
with the American Documentation Institute. 
Order Document No. 5846, remitting $1.25 
for photocopies, or $1.25 for microfilms. 
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are found mainly between the two 
most severe conditions of inversion 
(IV-NKF) and (IV-IKF) and the 
other two less severe conditions of 
inversion. 

Task complexity was varied in 
this experiment by having Ss write 
dots, ‘‘a’s,’’ triangles. Duncan (1955) 
Range tests were computed for the 
differences in performance at the 
three levels of task complexity for 
all four conditions of perception used 
in the experiment. The results of 
these tests, made on the data for 
Day 10, show, that for contact time 
significant differences between the 
dotting task and the other two tasks 
are found on both conditions of 
inversion of the visual field (IV-NKF 
and IV-IKF). A similar difference 
in travel time is found only for the 
visual inversion condition (IV-NKF). 
The absence of differences between 
“a"’ and triangle tasks at particular 
perceptual conditions does not mean 
that the writing of dots, triangles, 
or “a's” is unaffected by visual and 
kinesthetic inversion, but that the 
three tasks were not differentially 
affected in a significant way under a 
particular perceptual condition. 


DISCUSSION 


Results of this study show that the 
initial and final adjustment to perceptual 
inversion depends upon the sensory 
mode and degree of such inversion, upon 
the component movements in the in- 
verted performance, and upon the com- 
plexity of the task situation. With more 
complex tasks, both the initial and final 
effects of inversion are more severe. 
Adjustive recovery to visual inversion 
in more complex tasks is not complete 
after practice extending over 10 days. 

The results of this experiment conform 
in detail to predictions based on the 
theory of space-structured movements 
in human motion (Smith, 1954). This 
theory interprets the adjustment to 
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inverted vision and/or kinesthetic inver- 
sion as a matter of neuro-motor coordina- 
tion of these systems in relation to pri- 
mary organization of posture in the 
body. This theory would imply (a) 
some differential effects of inversion on 
the travel and manipulative movements 
in motion, (b) increase in effects of 
inversion on tasks of increasing com- 
plexity, and (c) greater effects of inver- 
sion with increase in amount and scope 
of such perceptual change. All of these 
effects were observed in the experiment 
in the manner predicted by the theory. 
It is to be noted that a theory based on 
reinforcement principles alone, or upon 
principles of general perceptual organiza- 
tion cannot be articulated to deal sys- 
tematically with the variables of inver- 
sion specifically interpreted in this study. 

The results found here show that the 
process of readjustment to perceptual 
inversion involves considerable speci- 
ficity of the response in relation to move- 
ment components involved, in relation 
to level of task difficulty and in relation 
to the type and degree of inversion. 
This high degree of specificity of readjust- 
ment in the space organization and 
reorganization of motion under percep- 
tual inversion is a clue to the speci- 
ficity of psychomotor actions in general. 
We believe that the theory of space- 
structured movements in motion, which 
defined the plan and interpretation of 
this study, to be essential in giving an 
account of the specificity of perceptual 
motor performance under both normal 
conditions and under conditions of 
perceptual inversion. 


SUMMARY 


Controlled experimental study involving 
assessment of the movements components in 
performance, and variation in the level of 
task difficulty and the type and degree of 
inversion, has been carried out to determine 
the effects of perceptual inversion on hand- 
writing. Results show that readjustment 


to inversion is specific to all three aspects 
of performance investigated, as : predicted 
by the theory of space-structured motion 
used to design and plan the study. In 
addition, the data and theory of the experi- 
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ment give a clue to the specificity which has 
been found to be characteristic of all psy- 
chomotor performance. 
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THE SENSITIVITY OF THE EYE TO TWO KINDS OF 
CONTINUOUS TRANSFORMATION OF A 
SHADOW-PATTERN ! 


KAI VON FIEANDT 
University of Helsinki 
AND JAMES J. GIBSON 
Cornell University 


Many years ago, the French mathe- 
matician Poincaré (1913, p. 250 ff.) 
put a psychological question which 
has not yet been answered by psy- 
chologists: How does the eye dis- 
tinguish between the change of posi- 
tion of a thing and the change of 
state of a thing? By “position,” he 
meant the distance and orientation 
of an object in the environment, 
which can change when either the 
observer or the object moves. By 
“state,” he meant the identity or 
rigidity of an object in the environ- 
ment, which can change when the 
object grows or changes shape (even 
to the extent of becoming a different 
object). If vision presents to the 
eye only a projection or silhouette 
of the object, how does the eye 
discriminate between the perspective 
alterations of this projection and the 
alterations arising from real changes of 
the object? For example, how can 
the observer tell the difference be- 
tween approach and growth, that 
is, between the object getting nearer 
and the object larger? How can 
he tell the difference between a 
turning motion of the object and a 
deformation of the object when, in 
both cases, its silhouette is compressed 
in one dimension? On the one hand, 


1 This experiment was performed at Cornell 
University where the first author was visiting 
on a grant from the U. S. Department of 
State. The first author designed the ap- 
paratus and carried out the experiment. 
The second author is responsible for the 
theory. We are indebted to Paul Olum of 
Cornell University for advice in the field of 
geometry. 


the silhouettes or visual sensations. 
of objects are transformed when we 
move from one point of view to 
another, or when rigid objects them- 
selves change position relative to the 
point of view. On the other hand, 
they are transformed when fluid 
or elastic objects change with respect 
to size or shape—a cloud or a surface 
of living tissue. But the changes of 
relative position are not confused in 
perception with the changes of struc- 
ture or state. Men and animals 
seem to distinguish the rigid and 
permanent surfaces of the environ- 
ment from the elastic or animate 
surfaces of the environment with 
considerable success. But why do 
both not look indifferently elastic since 
both may undergo transformation ? 
Poincaré, being a geometer, realized 
that the two kinds of transformations 
belong to different “groups.” He 
thought that a man might learn to 
distinguish a change of position from 
a change of state by virtue of the 
fact that the first kind of transforma- 
tion can always be reversed by a 
locomotor movement of the observer 
himself whereas the second kind of 
transformation cannot. That is, the 
observer can always move to a point 
of view which nullifies the first kind 
of transformation and re-establishes 
the original sensation and he can 
learn to associate his movement with 
the change of sensation. This is an 
interesting theory. However, a less 
speculative and more direct explana- 
tion is possible. It would be that an 
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eye, for reasons to be investigated, 
is sensitive to a change of form, and, 
moreover, to the type of a change of 
form. In this respect, an eye is to 
be contrasted with a camera. The 
eyes of animals and men are very good 
at detecting motions; perhaps they 
are just as good at discriminating 
types of motions. It might be that 
the one group, of continuous trans- 
formations, thé perspectives of rigid 
objects, constitutes one kind of stimu- 
lus for vision and that the other 
group, the “rubbery” transforma- 
tions, constitutes another kind of 
stimulus for vision. 

This hypothesis gains plausibility 
from the results of an experiment by 
Gibson and Gibson (1957) in which 
it was shown that a continuous per- 
spective transformation (with polar 
projection) is a sufficient stimulus for 
an impression of rigid motion in 
space.2 The transformation was car- 
ried by a pattern of shadows on the 
translucent screen of a point-source 
shadow projector (Gibson, 1957). 
The apparent surface seemed to 
rotate in the space behind the screen 
—that is, to change its position with- 
out change of its structure. But the 
experiment did not test the prediction 
that a continuous transformation of 
the other kind would give an impres- 
sion of non-rigid motion—that is, 
a change of structure without change 
of position in space. The aim of the 
present experiment was to compare 
the perceptions induced by the two 
kinds of transformation sequences, 
everything else being equal so far as 
possible, and to determine whether 
corresponding impressions of rigid 
and of non-rigid motion would occur 
spontaneously. 


2 This research is allied to the stereo- 
kinetic phenomenon of Benussi and Musatti 
(1924) and to the kinetic depth effect of 
Wallach, and his collaborators (e.g., Wallach 
Weisz, & Adams, 1956). 
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Fic. 1. The shadow-pattern. Note that 


the network is irregular and unfamiliar, and 
that a linear perspective cue is not available 
for the perception of slant. The change of 
pattern is indicated crudely by the arrows 
indicating displacements of elements. In 
the case of the perspective transformation, 
however, there is a vertical component of 
displacement as well as the horizontal 
component. 


METHOD 


Apparatus.—A point-source shadow pro- 
jector is designed to produce an ongoing 
perspective transformation. It was necessary 
to modify the apparatus so that it would 
also produce an ongoing transformation of 
the “rubbery” kind, and so that it could be 
shifted from the first to the second kind of 
transformation without any realization by O 
that the apparatus has been changed. 

For a decisive test, the two kinds of motion 
should be superficially similar but not mathe- 
matically similar. It was decided to fill 
the whole rectangular screen with a pattern 
(an irregular network of dark lines) which 
could be compressed from the right to the left 
of the window and then decompressed to 
restore the original pattern. There was to 
be always, in short, a gradient of leftward 
velocities of the elements of the pattern from 
a maximum on the right to zero on the left 
and then the same gradient of rightward 
velocities (Fig. 1). But one kind of compres- 
sion of the image was to be the projection of a 
rigid surface which had been made to turn, 
whereas the other kind was to be the projec- 
tion of an elastic surface which had been 
made to contract. 

The shadow-caster behind the screen was 
therefore constructed in a special way. It 
consisted of a net of elastic thread stretched 
on a rectangular frame whose edges were 
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projected outside the frame of the window 
and were therefore never visible. The E 
could either rotate the frame around a ver- 
tical axis at its left-hand edge or allow the 
right-hand edge to slide inward on a pair of 
guides. Both motions were silent. At the 
top and bottom of the frame the net was 
attached by rubber bands so that the sliding 
motion. was distributed over the whole area 
of the net. During rotation the slide was 
locked in its outermost position; during 
contraction the frame was kept parallel to 
the plane of the window.* 

With this arrangement, a 20° rotation of 
the frame (toward the window) was equiva- 
lent to an 8-cm. inward displacement of the 
slide—equivalent with respect to the number 
of diamond-shaped elements of the shadows 
entering the window to the right. Stops 
were placed to limit the motions of the frame 
at these extremes. The E could thus, at 
will, produce either of two types of cyclical 
motion of the shadow pattern, operating the 
frame by hand at the rate of one cycle in 
4 sec. The window was 38 X 40 cm. at a 
distance of 140 cm. from O's eye, and the 
point-source was also 140 cm. behind the 
window. The frame was 39 X 47 cm. (when 
fully stretched) at a distance of 57 cm. behind 
the window (when parallel to the window). 
The O observed the window through a monoc- 
ular aperture which hid the apparatus. 

Procedure.—One way of comparing the 
perceptions aroused by two stimulus events 
is to determine whether the events are 
spontaneously named or identified with dif- 
ferent words. This is not easy, however, 
if the ordinary language of the Os does not 
include accepted words for the different 
events. Another way is to determine whether 
the events are distinguished or discriminated 
by the consistent assignment of different 
words to different events, even though the 
words do not have accepted meaning. And 
a third way is to determine whether the 
transition from the presenting of one event 
to the presenting of the other arouses a 
reaction consistent with the direction of the 

§ The transformation of the shadow during 
lateral contraction of the frame was actually 
an approximation to the limiting case of a 
perspective transformation with parallel pro- 
tection instead of polar protection. ‘That is, 
it was similar to (but not wholly identical 
with) the motion which the shadow would 
have undergone if the frame had been rotated 
with the light source at a very great distance. 
This is a transformation which converts a 
square into a rectangle. With polar projec- 
tion a square is converted into a trapezoid. 
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transition. All three methods were used 
in the present experiment. 

A naive O was brought into the dimly 
lighted experimental room, seated, and told: 

“When you look through this tube you 
will see something. Please describe what 
you think it might be. When it begins to 
move, say what kind of movement it is. 
If the kind of movement changes, what new 
kind is it?”’ 

The shadow of the net was motionless for 
a few seconds, and then cycles of motion were 
presented. Half the Os began with con- 
traction and half with rotation. After three 
cycles (about 12 sec.) the other kind of motion 
was introduced without interruption for 
three cycles, and this alternation was con- 
tinued until each kind of motion had been 
presented three times. During these six 
presentations O was freely reporting. This 
was the first part of the experiment, intended 
to find out if the verbal descriptions fell into 
two categories, and whether or not these 
could be taken to mean rigid and non-rigid 
motion. No suggestions were given at any 
time. 

Nearly all Os (20 out of 22 tested) could 
be considered to have distinguished two 
classes of motion, although they had dif- 
ficulty in finding words to describe what they 
saw. Using the terms spontaneously used 
by O, this group was then given the further 
instructions: 

“From now on always use these names 
when you see the corresponding movements. 
Report whenever the movement changes from 
one to the other.”’ 

There followed 50 three-cycle presenta- 
tions, 25 of contraction and 25 of rotation in 
a prearranged random order, half the Os 
beginning with one and half with the other. 
There were pauses after each trial so that the 
verbal identification could be recorded by E. 
This was the second part of the experiment. 

Finally, a continuous series of motion- 
cycles was presented consisting of 47 transi- 
tions from contraction to rotation or the 
reverse, each transition being separated from 
the preceding by either three or four cycles of 
motion, these two possibilities being ran- 
domized. A record was kept of the changes 
reported or missed.- 

As an extra guarantee against the use of 
auditory cues for distinguishing the motions 
of the shadow-caster, a moderate masking 
noise was produced with a tape recorder 
placed near O. 

Subjects—Twenty Os completed the ex- 
periment, of ‘which 18 were undergraduate 
students with little or no training in psy- 
chology and two were psychologists. 
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RESULTS 


The motionless shadow of the net 
was always seen as something in the 
frontal plane. The subsequent cycles 
of movement were described in a 
variety of terms whose meaning had 
to e interpreted. But the reports 
clearly fell into two types, and the 
change from one to the other move- 
ment of the shadow-caster was prompt- 
ly noticed and reported. 

The conclusion that the two kinds 
of transformation aroused two differ- 
ent perceptions is also warranted by 
the second part of the experiment. 
Out of 1000 judgments required of the 

Os, 979 or 98% were scored as 
correct, in that they consistently 
applied the terminology developed 
in the first six presentations. 

The conclusion that the transitions 
were immediately responded to is 
proved by the third part of the experi- 
ment. Of 940 changes from rotation 
to compression or the reverse, all 
were responded to, and only one O 
on one occasion reported a change 
in the motion when none had occurred. 


May we conclude that the Os spon- 
taneously saw the same events which the 
Es saw in response to the two kinds of 
transformations? These events were, on 
the one hand, a rigid turning movement 
of an apparent surface in the space 
behind the window, and, on the other, an 
elastic or squeezing movement of an 
apparent surface, usually without change 
of depth but sometimes with a vague 
impression of it. The rigid rotation in 
depth could, with a shift of attitude, be 
seen as a compression, but this was not the 
same kind of compression as the elastic 
movement. The experience of compres- 
sion obtained with the perspective trans- 
formation was, moreover, unstable. 

The Os did not use the above terms to 
describe the events in question. They 
used familiar words like ‘‘wire fencing,” 
“swinging toward me,” “back and forth,” 
“compression,” ‘“‘diagonal movement,” 
“horizontal movement,”’ and the like. 
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The interpretation of such words is 
subject to error. Nevertheless the judg- 
ment of the authors is that Os in this 
experiment perceived essentially the 
same events. The perceptions differed 
in lacking the precision which can be 
achieved by using such terms as “rigid” 
and “elastic.” 

It is clear that, by two objective 
criteria, unsophisticated Os can dis- 
tinguish between a continuous perspective 
transformation and a non perspective trans- 
formation similar to it with respect to the 
gross pattern of displacements of ele- 
ments. The visual system is sensitive 
to different kinds of change of projected 
form, at least in adults. This fact would 
explain the puzzle of Poincaré—why we 
do not confuse the rigid movements of 
surfaces in depth with the elastic move- 
ments of surfaces in the frontal plane. 
We cannot conclude with certainty that 
these Os did actually perceive a rigid 
surface in one case and a non-rigid sur- 
face in the other, for the extent to which 
the abstract property of rigidity was 
induced in our experimental situation is 
not clear. Nevertheless, the words used 


spontaneously by the Os suggest a 
tendency to do so. 
SUMMARY 


Continuous perspective transformations 
of a rigid object and continuous nonper- 
spective transformations of an elastic object 
were presented to Os under three conditions 
to determine whether discrimination of 
these would occur. All tests supported the 
inference that the discrimination is very 
efficient. 
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REPLICATION REPORT: THE RELATIONSHIP OF MANIFEST ANXIETY 
AND ELECTRIC SHOCK TO EYELID CONDITIONING! 
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The present study partially replicates that 
of Spence, Farber, and Taylor (1954). 
Although apparatus was the same, certain 
changes were made in CR scoring and S 
assignment. 

A response was scored as a CR during the 
200 to 500 msec. interval after CS onset only 
if it were preceded by a 200-msec. period of no 
response. Rationale for this change was 
based on the fact that a voluntary blink 
initiated during the 200-msec. interval follow- 
ing CS onset is often carried over into the 
critical scoring range. 

Sixty college students were divided into 
above- and below-median groups on MAS 
scores (Taylor, 1953). Within each group, 
pairs of Ss matched on MAS scores were 
assigned to “‘shock"’ and ‘“no-shock’’ sub- 
groups. The data was analyzed in two blocks 
of 40 trials each and with respect to the first 
40 trials alone. This was done in two sepa- 
rate analyses. The first was a 2 X 2 X 2 
design with High vs. Low Anxiety as a 
“between-effect” and Shock-No Shock and 

















BLOCKS OF TEN TRIALS 


Percentage of CR's in blocks 
of 10 trials. 


Fic. 1. 


1 Acknowledgment is offered Leland Thune, Vander- 
bilt University, for his generous guidance. 
? Previously at Peabody College. 


Trials 1-40 and 41-80 as “within-effects” 
(Lindquist, 1953, Type VII). A 2x2 
table (Shock X Anxiety) (Lindquist Type I 
design) was analyzed with data of the first 
40 trials only. 

Figure 1 presents the curves of acquisition. 
In the first analysis a significant difference 
was found only for trial blocks, indicating a 
greater number of CR’s in the later trial 
block. No differences in learning could be 
attributed to the shock or anxiety dimensions. 
In the second analysis differences in learning 
were significantly related to differences ia 
anxiety level only. 

As in the original study, the present find- 
ings support the relationship between MAS 
and conditioning during the first 40 trials. 
The groups were not significantly different 
during the last 40 trials. Contrary to the 
original study, the present study finds no 
differences in conditioning as a function of 
shock. A further difference between the 
present and the original one seems to be the 
difference in level of asymptote. The Ss 
in the present study asymptoted approxi- 
mately at or below a 50% CR rate; whereas, 
groups in the original study attained a 
60-70% CR rate. Both findings can perhaps 
be accounted for by the difference in scoring 
procedures between the two studies. It is 
possible that the shock merely caused pro- 
longed “voluntary” blinking which was 
scored positively in the Iowa study. 

The authors are of the opinion that the 
use of a fixed interval (3 sec.) for the warning 
signal may also have given Ss an excellent cue 
to the onset of the CS (light) and thereby 
may have greatly increased the likelihood 
of the Ss voluntarily responding to the CS. 
In fact, many Ss reported upon completion 
of the session that they used the warning 
signal to avoid the puff. 

If this criticism is cogent it may help ac- 
count for the differences in the present study 
and the Iowa study in the amount of condi- 
tioning. The constant warning signal would 
have encouraged “‘guessing’’ in both studies 
and this guessing would be more sharply 
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penalized in the scoring procedure of the 
present study. Further substantiation for 
this notion comes from the fact that the 
frequency of CR’s are more similar for the 
two studies during the earlier trials than 
during the later trials. In the earlier trials 
S would not have learned adequately the 
cue value of the warning signal. Therefore, 
voluntary or guessing responses as a function 
of the signal would be less expected here than 
during the later trials. 
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SUPPLEMENTARY REPORT: DISCRIMINATION LEARNING WITH 
PROBABILISTIC REINFORCEMENT SCHEDULES! 
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This study deals with discrimination 
learning in a verbal conditioning situation 
and is an extension of research reported by 
Popper and Atkinson (1958). The study 
differs from theirs in that a new set of param- 
eter values and a more rigorously controlled 
experimental procedure were employed. Two 
stimuli, T,; and Ts, were used and two re- 
sponses, A, and As, were available to 5. 
Each trial began by the presentation, with 
equal probability, of either T; or T;. On 
a T, trial, A; was correct with the probability 
m, and A, was correct with probability 
1 —-*,. For a Tz trial, 2. was identically 
defined. 

Method.—Five groups were used. For all 
groups 7, = .9. The groups differed with 
respect to the wr: parameter which assumed 
the values of .9 (G-I), .7 (G-II), .5 (G-III), 
.3 (G-IV) and .1 (G-V). The Ss were given 
400 trials. However, on the first 40 trials 
all groups were given m, = 72 = .5. The Ss 
were 180 undergraduates, 36 per group. The 
experimental design was identical to that of 
Popper and Atkinson (1958) except that Ss 
were run in subgroups of three, each S placed 
in a private booth after instructions. The 
apparatus, viewed from within S’s booth, 
consisted of two keys attached to the base 
of a panel, upon the panel were mounted 
four small lights. Two lights were in acolumn 


1 This research was supported in part by a grant to 
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tion. 


centered above the keys and served as the 
T, and T;, stimuli. Each of the two remain- 
ing lights (the reinforcing signals) was 
mounted directly above one of the keys. 
On all trials the signal light (T; or T:) was 
lighted for 1.5 sec.; the signal light went off 
simultaneously with the onset of a reinforcing 
light. The reinforcing light remained on for 
1.8 sec., and was followed by an intertrial 
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P(Ai!T») in successive blocks of 40 trials. Each point 
is based on approximately 20 observations per S. 
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interval of 4 sec. Trials were run without 
interruption and S made his response while 
the signal light was on. Instructions were 
similar to those presented in Atkinson and 
Suppes (1958) except that they were modified 
to describe the T, and T, stimuli as in Popper 
and Atkinson (1958). 

Results.—Figure 1 presents mean response 
curves over all trials of the experiment.’ In 
this figure the proportion of A,’s made on 
T, trials, p(A,\T,), and the proportion of 
A;'s made on T; trials, p(A;|T2), are plotted 
in successive 40-trial blocks. The corre- 
sponding m values are indicated on the far 
right. The asymptotic and pre-asymptotic 
characteristics of these curves are identical 
to those found by Popper and Atkinson 
(1958). One aspect of these results to be 
emphasized is the relation between the ob- 
served p,(Ai/T;) and mw. A convex func- 
tion was found by Popper and Atkinson 
when m2 varied from .85 to .15 (for a fixed 
m, = .85) and the same relation holds for this 
study. Specifically, if the proportions com- 


puted over the last 120 trials are used as 
estimates of p,(Ai|T;), the obtained values 
are .930, .867, .808, .856, and .895 for Groups 
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I to V, respectively. The convexity was 
found significant by evaluation of the quad- 
ratic component of the treatment sum of 
squares, F = 18.2. 

The significance of these findings with 
regard to stochastic theories of discrimination 
learning is discussed by Atkinson (1958). 
In paiticular, the demonstrated convex 
relation between m: and p,(A;,|T;) suggests 
that the Burke and Estes (1957) component 
model for discrimination learning is not 
applicable in this type of situation. 
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